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SYSTEM, METHOD, AND ARTICLE OF MANUFACTURE FOR A 
KNOWLEDGEBASE FRAMEWORK 



FIELD OF THE INVENTION 



The present invention relates to information management and retrieval and more 
particularly to frameworks for obtaining, managing, and providing information from 
a plurality of information sources. 



BACKGROUND OF THE INVENTION 



People who use computer systems and networks often need to look up information 
about the system they are using. Traditionally, information was stored in books and 
manuals, which were often kept physically near to the computer. If a user needed to 
look up information, he turned to a single soiirce-the paper manuals stored 
conveniently nearby. 

Currently, however, the amount of technical information available about a given 
computer system can be very large and can be stored at a wide variety of sources. 
Information is often provided to customers in "online" form, dispensing entirely 
with paper copies. This online information includes online databases, CD ROM 
databases, proprietary help systems, and online manuals. Large amounts of technical 
information are also available from third party online sources and from sources such 
as the World Wide Web. 



Amid an apparent wealth of online information, people still have problems finding 
the information they need. Onhne information retrieval may have problems 
30 including those related to inappropriate user interface designs and to poor or 

inappropriate organization and structure of the information. Storage of information 
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online in a variety of forms leads to certain information retrieval problems, several 
of which are described below. 

The existence of a variety of information sources leads to the lack of a unified 
5 information space. An "information space" is the set of all sources of infomiation 
that is available to a user at a given time or setting. When information is stored in 
many formats and at many sources, a user is forced to spend too much "overhead" 
on discovering and remembering where different information is located (e.g., online 
technical books, manual pages ("manpages"), release notes, help information, etc.). 
10 The user also spends a large amount of time remembering how to find information 
in each delivery mechanism. Thus, it is difficult for the user to remember where 
potentially relevant information might be, and the user is forced to jump between 
multiple different online tools to find it. 

1 5 The existence of a variety of information sources leads to information strategies that 
lack cohesion. Users currently must learn to use and remember a variety of 
metaphors, user interfaces, and searching techniques for each delivery mechanism 
and class of information. No one type of interface suits all users. Furthermore, a user 
may need different types of searching techniques and interfaces, depending on the 

20 circumstances and the natiire of the specific information needed. 

The existence of a variety of information sources leads to lack of links between 
sources of information. Conventional dehvery mechanisms often support only 
loosely stiiictured navigation, such as keyword search or hyperhnks. Such 
25 mechanisms provide the user with only a local organization of information instead 
of providing a global picture of the information space. 

The existence of a variety of information sources leads to j&ustration if the 
information uses a wide variety of terms or uses terms not familiar to the user. In 
30 addition, users employ concepts and terms differently than technical writers and 
authors. Conventional delivery mechanisms often rely on a keyword search as a 
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primary means of finding information. If the user's vocabulary does not sufficiently 
overlap with indices employed by a dehvery mechanism, a keyword search will 
result in a high percentage of disappointing and thistrating "term misses." The only 
recovery method for a failed keyword search is simply to guess at better query. 

5 

The existence of a variety of information sources leads to titles and descriptions of 
the information that are not intuitive to a user. Users often conceptually group and 
describe problems differently than do information organizers and writers. If, for 
example, a user does not know the title of a book or the name of a database, he may 
10 not be able to find the information stored therein. 

As computer systems become more complex and as sources of online information 
proliferate, it becomes more and more difficult for users to locate the information 
they need. Even worse, users may not always be aware of all the existing sources of 
1 5 information. Moreover, certain users may not use certain sources of information, 

even though they are aware of them, if they are not familiar with the interface or find 
it too difficult to use. 
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SUMMARY OF THE INVENTION 

A system, method and article of manufacture are provided for a knowledgebase 
framework. Information is obtained from at least one source utilizing a network. 
5 Utilizing a knowledge model, an index is generated for the obtained information. 
The generated index includes a plurality of items each associated with at least some 
of the obtained information. Utilizing the network, the generated index is displayed 
to a user. The user is permitted to select an least one of the items of the index. The 
information associated with the selected item is then displayed to the user utilizing 
10 the network. 

hi an aspect of the present invention, one of the sources from which information is 
obtained maybe an internal source. In another aspect of the present invention, one 
of the sources from which information is obtained may be an external source 

15 accessible utilizing a wide are network. In a further aspect of the present invention, 
the information obtained from the sources may include pharmaceutical information, 
hi yet a fiirther aspect of the present invention, displaying of the information 
associated with the selected item (or entry) to the user may also include utilizing the 
network to retrieve the associated information from the source from which the 

20 associated information was obtained. In even another aspect of the present 

invention, the network maybe capable of commvmicating using TCP/IP protocol. 

In an embodiment of the present invention, the network may be utilized to monitor 
one or more of the sources for updated information relating to one or more items in 
25 the index. In such an embodiment, when updated information is detected at one of 
the knowledge sources, a notice may be generated regarding the updated 
information. This notice may then be fransmitted to the user utilizing the network to 
notify the user of the updated information. As an option, the user may be allowed to 
select the source(s) to be monitored for updates or other changes. 

30 
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In another embodiment of the present invention, the user may be permitted to input a 
search term utiHzing the network. The index may be searched for items associated 
with the search term. Items of the index associated with the search term may then be 
displayed to the user utiHzing the network. 



BRIEF DESCRIPTION OF THE DRAWINGS 



The invention will be better understood when consideration is given to the following 
detailed description thereof. Such description makes reference to the annexed 
drawings wherein: 

Figure 1 is a schematic diagram of a hardware implementation of one embodiment 
of the present invention; 

Figure 2 is a schematic diagram of a knowledgebase framework in accordance with 
an embodiment of the present invention; 

Figure 3 is a schematic diagram of the knowledgebase framework illustrating how 
the index may be used as a user's primary gateway to a wide variety of information 
sources in accordance with an embodiment of the present invention; 

Figure 4 is a schematic diagram of an illusfrative knowledge model in accordance 
with an embodiment of the present invention; 

Figure 5 illusfrates a flowchart for a process for a knowledgebase framework in 
accordance with an embodiment of the present invention; 

Figure 6 illustrates a flowchart for a process for a knowledge model in accordance 
with an embodiment of the present invention; 

Figure 7 is a schematic illusfration of a log in frame for permitting a user to log into 
the knowledgebase framework utihzing a network in accordance with an 
embodiment of the present invention; 
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Figure 8 is a schematic illustration of a search and browse frame of the decision 
support application for permitting a user browse the knowledge model-based index 
in accordance with an embodiment of the present invention; 

5 Figure 9 is a schematic illustration of the items of the knowledge model-base index 
displayed in the search and browse frame upon selection of the cardiology/vascular 
diseases item in Figure 8 in accordance with an embodiment of the present 
invention; 

10 Figure 10 is a schematic illustration of the items of the knowledge model-base index 
displayed in the search and browse frame upon selection of the ACE biological 
target item in Figure 9 in accordance with an embodiment of the present invention; 

Figure 11 is a schematic illustration of illustrative items of the knowledge model- 
15 base index displayed in the search and browse frame upon selection of the captopril 
compound item in accordance with an embodiment of the present invention; 

Figure 12 is a schematic illusfration of a browser frame which displays information 
from knowledge source relating to an item upon selection of the visit source 
20 selection in accordance with an embodiment of the present invention; 

Figure 13 is a schematic illustration of an exemplary display of the display bar upon 
selection of a scientist/person item in accordance with an embodiment of the present 
invention; 

25 

Figure 14 is a schematic illustration of illustrative items of the knowledge model- 
base index displayed in the search and browse frame upon selection of a patent item 
in accordance with an embodiment of the present invention; 

30 Figure 15 is a schematic illusfration of a browser frame which displays information 
from knowledge source relating to the selected patent item of Figure 14 upon 
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selection of the visit source selection in accordance with an embodiment of the 
present invention; 

Figure 16 illustrates a recent news frame that the user may access after logging on to 
5 the network in accordance with an embodiment of the present invention; 

Figure 17 illustrates a flowchart for a process for monitoring information in a 
knowledgebase framework in accordance with an embodiment of the present 
invention; 

10 

Figure 18A is a schematic illustration of a process map displayed in a research frame 
for use in a knowledgebase framework in accordance with an embodiment of the 
present invention; 

15 Figure 18B is a schematic illustration of an illustrative template for a process map 
displayed in the research frame in accordance with an embodiment of the present 
invention; 

Figure 18C illustrates an inbox for managing email messages displayed in an 
20 organizer frame for use in a knowledgebase framework in accordance with an 
embodiment of the present invention; 

Figure 18D illustrates a calendar for managing events and appointments of a user 
displayed in an organizer frame for use in a knowledgebase framework in 
25 accordance with an embodiment of the present invention; 

Figure 19 is a schematic diagram of an exemplary system architecture in accordance 
with an embodiment of the present invention; 

30 Figure 20 is a flowchart of the system in accordance with an embodiment of the 
present invention; 
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Figure 21 is a flowchart of a parsing unit of the system in accordance with an 
embodiment of the present invention; 

5 Figure 22 is a flowchart for pattern matching in accordance with an embodiment of 
the present invention; 

Figure 23 is a flowchart for a search unit in accordance with an embodiment of the 
present invention; 

10 

Figure 24 is a flowchart for overall system processing in accordance with an 
embodiment of the present invention; 

Figure 25 is a flowchart of topic processing in accordance with an embodiment of 
15 the present invention; 

Figure 26 is a flowchart of meeting record processing in accordance with an 
embodiment of the present invention; 



20 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Embodiments of the present invention show how the concept of knowledge 
integration can be applied in the business world, especially in the pharmaceutical 
5 industry. Aspects of the present invention may be targeted for users active in the 
drug discovery process such as scientist and other researchers. Embodiments of the 
present invention may use knowledge integration technology to semantically 
integrate the knowledge capital located in various isolated repositories in the 
Internet. The information jfrom this repositories are extracted and are classified 
10 based on various facets such as, for example drug, chemical compound, biological 
target, scientist, etc. As the results, embodiments of the present invention can 
graphically show users how the various facets of the information are related to each 
other. 

15 An embodiment of a system in accordance with the present invention is preferably 
practiced in the context of a personal computer such as an IBM compatible personal 
computer, Apple Macintosh computer or UNIX based workstation. A representative 
hardware environment is depicted in Figure 1, which illustrates a typical hardware 
configuration of a workstation in accordance with a preferred embodiment having a 

20 central processing unit 110, such as a microprocessor, and a nimiber of other imits 
interconnected via a system bus 112. The workstation shown in Figure 1 includes a 
Random Access Memory (RAM) 114, Read Only Memory (ROM) 116, an I/O 
adapter 1 18 for connecting peripheral devices such as disk storage units 120 to the 
bus 112, a user interface adapter 122 for connecting a keyboard 124, a mouse 126, a 

25 speaker 128, a microphone 132, and/or other user interface devices such as a touch 
screen (not shown) to the bus 112, communication adapter 134 for connecting the 
workstation to a communication network (e.g., a data processing network) and a 
display adapter 136 for connecting the bus 112 to a display device 138. The 
workstation typically has resident thereon an operating system such as the Microsoft 

30 Windows NT or Windows/95 Operating System (OS), the IBM OS/2 operating 
system, the MAC OS, or UNIX operating system. Those skilled in the art will 
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appreciate that the present invention may also be implemented on platforms and 
operating systems other than those mentioned. 

A preferred embodiment is written using JAVA, C, and the C++ language and 
5 utiUzes object oriented programming methodology. Object oriented prograrmning 
(OOP) has become increasingly used to develop complex applications. As OOP 
moves toward the mainstream of software design and development, various software 
solutions require adaptation to make use of the benefits of OOP. A need exists for 
these principles of OOP to be applied to a messaging interface of an electronic 
] 0 messaging system such that a set of OOP classes and objects for the messaging 
interface can be provided. 

OOP is a process of developing computer software using objects, including the steps 
of analyzing the problem, designing the system, and constructing the program. An 

1 5 object is a software package that contains both data and a collection of related 

structures and procedures. Since it contains both data and a collection of structures 
and procedures, it can be visuahzed as a self-sufficient component that does not 
require other additional structures, procedures or data to perform its specific task. 
OOP, therefore, views a computer program as a collection of largely autonomous 

20 components, called objects, each of which is responsible for a specific task. This 
concept of packaging data, structures, and procedures together in one component or 
module is called encapsulation. 

In general, OOP components are reusable software modules which present an 
25 interface that conforms to an object model and which are accessed at nm-time 

through a component integration architecture. A component integration architecture 
is a set of architecture mechanisms which allow software modules in different 
process spaces to utilize each others capabilities or fiinctions. This is generally done 
by assuming a common component object model on which to build the architecture. 
30 It is worthwhile to differentiate between an object and a class of objects at this point. 
An object is a single instance of the class of objects, which is often just called a 
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class. A class of objects can be viewed as a blueprint, from which many objects can 
be formed. 

OOP allows the programmer to create an object that is a part of another object. For 
5 example, the object representing a piston engine is said to have a composition- 
relationship with the object representing a piston. In reality, a piston engine 
comprises a piston, valves and many other components; the fact that a piston is an 
element of a piston engine can be logically and semantically represented in OOP by 
two objects. 

10 

OOP also allows creation of an object that "depends from" another object. If there 
are two objects, one representing a piston engine and the other representing a piston 
engine wherein the piston is made of ceramic, then the relationship between the two 
objects is not that of composition. A ceramic piston engine does not make up a 

15 piston engine. Rather it is merely one kind of piston engine that has one more 
limitation than the piston engine; its piston is made of ceramic. In this case, the 
object representing the ceramic piston engine is called a derived object, and it 
inherits all of the aspects of the object representing the piston engine and adds 
further limitation or detail to it. The object representing the ceramic piston engine 

20 "depends from" the object representing the piston engine. The relationship between 
these objects is called inheritance. 

When the object or class representing the ceramic piston engine inherits all of the 
aspects of the objects representing the piston engine, it inherits the thermal 

25 characteristics of a standard piston defined in the piston engine class. However, the 
ceramic piston engine object overrides these ceramic specific thermal 
characteristics, which are typically different from those associated with a metal 
piston. It skips over the original and uses new fimctions related to ceramic pistons. 
Different kinds of piston engines have different characteristics, but may have the 

30 same underlying fimctions associated with it (e.g., how many pistons in the engine, 
ignition sequences, lubrication, etc.). To access each of these functions in any piston 
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engine object, a programmer would call the same functions with the same names, 
but each type of piston engine may have different/overriding implementations of 
functions behind the same name. This ability to hide different implementations of a 
function behind the same name is called polymorphism and it greatly simphfies 
communication among objects. 

With the concepts of composition-relationship, encapsulation, inheritance and 
polymorphism, an object can represent just about anything in the real world, hi fact, 
one's logical perception of the reality is the only limit on determining the kinds of 
things that can become objects in object-oriented software. Some typical categories 
are as follows: 

• Objects can represent physical objects, such as automobiles in a traffic-flow 
simulation, electrical components in a circuit-design program, countries in an 
economics model, or aircraft in an air-traffic-control system. 

• Objects can represent elements of the computer-user environment such as 
windows, menus or graphics objects. 

• An object can represent an inventory, such as a persoimel file or a table of 
the latitudes and longitudes of cities. 

• An object can represent user-defined data types such as time, angles, and 
complex numbers, or points on the plane. 

With this enormous capability of an object to represent just about any logically 
separable matters, OOP allows the software developer to design and implement a 
computer program that is a model of some aspects of reality, whether that reality is a 
physical entity, a process, a system, or a composition of matter. Since the object can 
represent anything, the software developer can create an object which can be used as 
a component in a larger software project in the future. 

If 90% of a new OOP software program consists of proven, existing components 
made from preexisting reusable objects, then only the remaining 10% of the new 
software project has to be written and tested from scratch. Since 90% already came 
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from an inventory of extensively tested reusable objects, the potential domain from 
which an error could originate is 10% of the program. As a result, OOP enables 
software developers to build objects out of other, previously built objects. 

5 This process closely resembles complex machinery being built out of assemblies and 
sub-assemblies. OOP technology, therefore, makes software engineering more like 
hardware engineering in that software is built from existing components, which are 
available to the developer as objects. All this adds up to an improved quahty of the 
software as well as an increased speed of its development. 

10 

Programming languages are beginning to fully support the OOP principles, such as 
encapsulation, inheritance, polymorphism, and composition-relationship. With the 
advent of the C++ language, many commercial software developers have embraced 
OOP. C++ is an OOP language that offers a fast, machine-executable code. 

15 Furthermore, C++ is suitable for both commercial-application and systems- 
programming projects. For now, C++ appears to be the most popular choice among 
many OOP programmers, but there is a host of other OOP languages, such as 
Smalltalk, Common Lisp Object System (CLOS), and Eiffel. Additionally, OOP 
capabilities are being added to more traditional popular computer programming 

20 languages such as Pascal. 



The benefits of object classes can be summarized, as follows: 

• Objects and their corresponding classes break down complex programming 

problems into many smaller, simpler problems. 
25 • Encapsulation enforces data abstraction through the organization of data into 

small, independent objects that can commimicate with each other. 

Encapsulation protects the data in an object from accidental damage, but 

allows other objects to interact with that data by calling the object's member 

functions and structures. 
30 • Subclassing and inheritance make it possible to extend and modify objects 

through deriving new kinds of objects from the standard classes available in 
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the system. Thus, new capabiUties are created without having to start from 
scratch. 

• Polymorphism and multiple inheritance make it possible for different 
programmers to mix and match characteristics of many different classes and 

5 create specialized objects that can still work with related objects in 

predictable ways. 

• Class hierarchies and containment hierarchies provide a flexible mechanism 
for modeling real-world objects and the relationships among them. 

• Libraries of reusable classes are useful in many situations, but they also have 
10 some limitations. For example: 

• Complexity, hi a complex system, the class hierarchies for related classes 
can become extremely confusing, with many dozens or even hundreds of 
classes. 

• Flow of control. A program written with the aid of class libraries is still 
15 responsible for the flow of control (i.e., it must control the interactions 

among all the objects created from a particular library). The programmer has 
to decide which functions to call at what times for which kinds of objects. 

• Duplication of effort. Although class libraries allow programmers to use and 
reuse many small pieces of code, each programmer puts those pieces 

20 together in a different way. Two different programmers can use the same set 

of class libraries to write two programs that do exactly the same thing but 
whose internal structure (i.e., design) maybe quite different, depending on 
hundreds of small decisions each programmer makes along the way. 
Inevitably, similar pieces of code end up doing similar things in slightly 

25 different ways and do not work as well together as they should. 

Class libraries are very flexible. As programs grow more complex, more 
programmers are forced to reinvent basic solutions to basic problems over and over 
again. A relatively new extension of the class library concept is to have a 
30 framework of class libraries. This framework is more complex and consists of 
significant collections of collaborating classes that capture both the small scale 
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patterns and major mechanisms that implement the common requirements and 
design in a specific application domain. They were first developed to free 
apphcation programmers from the chores involved in displaying menus, windows, 
dialog boxes, and other standard user interface elements for personal computers. 

5 

Frameworks also represent a change in the way programmers think about the 
interaction between the code they write and code written by others. In the early days 
of procedural programming, the programmer called libraries provided by the 
operating system to perform certain tasks, but basically the program executed down 
1 0 the page from start to finish, and the programmer was solely responsible for the flow 
of control. This was appropriate for printing out paychecks, calculating a 
mathematical table, or solving other problems with a program that executed in just 
one way. 



1 5 The development of graphical user interfaces began to turn this procedural 

programming arrangement inside out. These interfaces allow the user, rather than 
program logic, to drive the program and decide when certain actions should be 
performed. Today, most personal computer software accomplishes this by means of 
an event loop which monitors the mouse, keyboard, and other sources of external 

20 events and calls the appropriate parts of the programmer's code according to actions 
that the user performs. The programmer no longer determines the order in which 
events occur. Instead, a program is divided into separate pieces that are called at 
unpredictable times and in an unpredictable order. By relinquishing control in this 
way to users, the developer creates a program that is much easier to use. 

25 Nevertheless, individual pieces of the program written by the developer still call 
libraries provided by the operating system to accomplish certain tasks, and the 
programmer must still determine the flow of control within each piece after it's 
called by the event loop. Application code still "sits on top of the system. 



30 Even event loop programs require programmers to write a lot of code that should not 
need to be written separately for every application. The concept of an application 
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framework carries the event loop concept flirther. Instead of dealing with all the 
nuts and bolts of constructing basic menus, windows, and dialog boxes and then 
making these things all work together, programmers using application frameworks 
start with working application code and basic user interface elements in place. 
5 Subsequently, they build from there by replacing some of the generic capabilities of 
the framework with the specific capabilities of the intended application. 

Application frameworks reduce the total amount of code that a programmer has to 
write from scratch. However, because the framework is really a generic application 
that displays windows, supports copy and paste, and so on, the programmer can also 
relinquish control to a greater degree than event loop programs permit. The 
framework code takes care of almost all event handling and flow of control, and the 
programmer's code is called only when the framework needs it (e.g., to create or 
manipulate a proprietary data structure). 

A programmer writing a framework program not only rehnquishes confrol to the 
user (as is also true for event loop programs), but also relinquishes the detailed flow 
of control within the program to the framework. This approach allows the creation 
of more complex systems that work together in interesting ways, as opposed to 
isolated programs, having custom code, being created over and over again for 
similar problems. 

Thus, as is explained above, a framework basically is a collection of cooperating 
classes that make up a reusable design solution for a given problem domain. It 
25 typically includes objects that provide default behavior (e.g., for menus and 

windows), and programmers use it by inheriting some of that default behavior and 
overriding other behavior so that the framework calls application code at the 
appropriate times. 

30 There are three main differences between frameworks and class libraries: 
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• Behavior versus protocol. Class libraries are essentially collections of 
behaviors that you can call when you want those individual behaviors in your 
program. A framework, on the other hand, provides not only behavior but 
also the protocol or set of rules that govern the ways in which behaviors can 
be combined, including rules for what a programmer is supposed to provide 
versus what the framework provides. 

• Call versus override. With a class library, the code the programmer 
instantiates objects and calls their member ftmctions. It's possible to 
instantiate and call objects in the same way with a framework (i.e., to treat 
the framework as a class library), but to take full advantage of a framework's 
reusable design, a programmer typically writes code that overrides and is 
called by the framework. The framework manages the flow of control 
among its objects. Writing a program involves dividing responsibilities 
among the various pieces of software that are called by the framework rather 
than specifying how the different pieces should work together. 

• Lnplementation versus design. With class libraries, programmers reuse only 
implementations, whereas with frameworks, they reuse design. A 
framework embodies the way a family of related programs or pieces of 
software work. It represents a generic design solution that can be adapted to 
a variety of specific problems in a given domain. For example, a single 
framework can embody the way a user interface works, even though two 
different user interfaces created with the same framework might solve quite 
different interface problems. 

Thus, through the development of frameworks for solutions to various problems and 
programming tasks, significant reductions in the design and development effort for 
software can be achieved. A preferred embodiment of the invention utilizes 
HyperText Markup Language (HTML) to implement documents on the hitemet 
together with a general-purpose secure communication protocol for a transport 
medium between the client and the Newco. HTTP or other protocols could be 
readily substituted for HTML without undue experimentation. Information on these 
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products is available in T. Bemers-Lee, D. Connoly, "RFC 1866: Hypertext Markup 
Language - 2.0" (Nov. 1995); and R. Fielding, H, Frystyk, T. Bemers-Lee, J. Gettys 
and J.C. Mogul, "Hypertext Transfer Protocol - HTTP/1 . 1 : HTTP Working Group 
Internet Draft" (May 2, 1996). HTML is a simple data format used to create 

5 hypertext documents that are portable from one platform to another. HTML 

documents are SGML documents with generic semantics that are appropriate for 
representing information from a wide range of domains. HTML has been in use by 
the World-Wide Web global information initiative since 1990. HTML is an 
application of ISO Standard 8879; 1986 Information Processing Text and Office 

10 Systems; Standard Generalized Markup Language (SGML). 



To date, Web development tools have been limited in their ability to create dynamic 
Web applications which span from client to server and interoperate with existing 
computing resources. Until recently, HTML has been the dominant technology used 
15 in development of Web-based solutions. However, HTML has proven to be 
inadequate in the following areas: 

• Poor performance; 

• Restricted user interface capabilities; 

• Can only produce static Web pages; 

20 • Lack of interoperability with existing applications and data; and 

• Inability to scale. 

Sim Microsystem's Java language solves many of the client-side problems by: 

• Improving performance on the client side; 

25 • Enabling the creation of dynamic, real-time Web applications; and 

• Providing the ability to create a wide variety of user interface components. 

With Java, developers can create robust User Interface (UI) components. Custom 
"widgets" (e.g., real-time stock tickers, animated icons, etc.) can be created, and 
30 client-side performance is improved. Unlike HTML, Java supports the notion of 

client-side validation, offloading appropriate processing onto the client for improved 
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performance. Dynamic, real-time Web pages can be created. Using the above- 
mentioned custom UI components, dynamic Web pages can also be created. 



Sun's Java language has emerged as an industry-recognized language for 
5 "programming the Internet." Sun defines Java as: "a simple, object-oriented, 
distributed, interpreted, robust, secure, architecture-neutral, portable, high- 
performance, multithreaded, dynamic, buzzword-compliant, general-purpose 
programming language. Java supports programming for the Internet in the form of 
platform-independent Java applets." Java applets are small, specialized applications 
10 that comply with Sun's Java Application Programming Interface (API) allowing 
developers to add "interactive content" to Web docximents (e.g., simple animations, 
page adornments, basic games, etc.). Applets execute within a Java-compatible 
browser (e.g., Netscape Navigator) by copying code from the server to client. From 
a language standpoint, Java's core feature set is based on C++. Sun's Java literatiire 
15 states that Java is basically, "C++ with extensions from Objective C for more 
dynamic method resolution." 



Another technology that provides similar function to JAVA is provided by 
Microsoft and ActiveX Technologies, to give developers and Web designers 

20 wherewithal to build dynamic content for the Internet and personal computers. 

ActiveX includes tools for developing animation, 3-D virtual reality, video and other 
multimedia content. The tools use fritemet standards, work on multiple platforms, 
and are being supported by over 100 companies. The group's building blocks are 
called ActiveX Controls, small, fast components that enable developers to embed 

25 parts of software in hypertext markup language (HTML) pages. ActiveX Controls 
work with a variety of programming languages including Microsoft Visual C++, 
Borland Delphi, Microsoft Visual Basic programming system and, in the fixture, 
Microsoft's development tool for Java, code named "Jakarta." ActiveX 
Technologies also includes ActiveX Server Framework, allowing developers to 

30 create server apphcations. One of ordinary skill in the art readily recognizes that 
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ActiveX could be substituted for JAVA without undue experimentation to practice 
the invention. 

To improve the decision making process, it may be helpful to deliver the right 
5 information to the right person at the right time. For example, the right information 
may include information from all parts of the organization and from external 
sources, information in the context of the business process (regardless of the source 
or format), and relevant information about business entities and relationships (rather 
than keywords and documents). Delivering the right information to the right person 

10 may involve filtering of the information based on needs of the individual, and 

delivery of the filtered information to the individual or team. The right time may 
mean providing up-to-date information and information on demand. 
Several challenges exist today that can make it difficult to meet these requirements. 
For example, both internal and external information may exist in different 

15 environments, platforms, formats such as proprietary databases, project reports and 
e-mail messages. Additionally, the underlying information repositories due to the 
heterogeneous nature, will need to remain unaltered because scientists and other 
business process participants store their information in diverse formats and the 
development of new apphcations using the repositories will continue, often in 

20 isolation. Further, traditional techniques of integration can be very time consuming 
to develop and often inflexible to rapid change. For instance, an average data- 
warehousing project typically takes between nine and twelve months to complete 
and most of these projects will typically only integrate structured information. Also, 
external information can be even a greater challenge: there are over one billion web 

25 pages (with this nxunber doubling every four months) and not all sites are useful or 
trustworthy. 

With embodiments of the present invention, the right information can be delivered to 
the right person at the right time. With embodiments of the present invention, the 
information can come from internal and external sources. The information can also 
30 be cleansed, integrated and placed in the right business context and also be 
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customized to meet an individual's particular needs. Embodiments of the present 
invention also allow information to be delivered proactively (i.e., "pushed"). 

One aspect of the present invention to help facilitate efficient collaboration by 
5 helping to allow the sharing of information with other team members and by 
providing a medium to communicate a set of well understood processes. 
Figure 2 is a schematic diagram of a knowledgebase framework 200 in accordance 
with an embodiment of the present invention. With such a knowledgebase 
framework, a wide variety of information sources may be extracted, cleansed and 
10 structured based on a knowledge model to create a highly customized index that can 
be accessed and browsed by a user. 

In closer detail, information may be contained in a plurality of internal sources 202 
and external sources 204. An internal source 202 of information is typically an 

15 information source that is an under the control of entity that employs the user and 
whose information may be proprietary to the entity. Litemal sources of information 
may include, for example: discovery information, PD information, clinical 
information, regulatory information, and M&S information. An external source 204 
of information is typically an information source that is not under the control of 

20 entity that employs the user. An external source may typically be accessible 

utilizing a wide are network such as the Litemet and World Wide Web. External 
sources may include for example: bio-analysis information, study management 
information, safety data information, market report information, and hitemet 
websites including government, public, and subscription based websites. 

25 

The knowledgebase framework may also include an index creator 206 which is 
connected to the internal and external sources 202, 204 by a network. The index 
creator 206 may also include or have access to a knowledgebase model 208. 
Utilizing the knowledgebase model 208, the index creator 206 may extract a wide 
30 variety information from the internal and external sources 202, 204, cleanse the 
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extracted information, restructure the extracted information and then reconcile the 
extracted information into a knowledge model -based index. 

The knowledgebase framework 200 may also include an index database 210 coupled 
5 to the index creator 206 for storing the knowledge model-based index created by the 
index creator 206. Users may then access the knowledge model-based index stored 
in the database 210 from a browser/portal 212 utilizing the network. As an option, 
the knowledgebase framework may also include a web server 214 or other similar 
type of computer for interfacing the browser/portal 212 with the database 210. 

10 

Additionally, the knowledgebase framework may include a decision support 
application 216 for helping a user determine what is the right information for the 
user and help the user receive the right information at the right time for the user. 
The decision support application 216 (in combination with the browser/portal 212) 

1 5 provides the user with the capability to browse and navigate through an integrated 
web of knowledge regardless the location of the knowledge sources. 
The decision support application 216 also allows the user to access internal and 
external information. The decision support application 216 may also be used to 
provide a user with information tailored for a specific process such as, for example, 

20 a drug discovery. The decision support application 216 may fiirther be used to help 
deliver the right information to the user by allowing them to monitor internal and 
external events at a wide range of granularity. 

Figure 3 is a schematic diagram of the knowledgebase framework illusfrating how 
25 the index may be used as a user's primary gateway to a wide variety of information 
sources 202, 204 in accordance with an embodiment of the present invention. From 
this index and with the assistance of the decision support application 216, a user has 
the ability to retrieve the original information source. Some illusfrative information 
sources may include a genomics database 300, a pre-clinical database 302, a clinical 
30 database 304, a departmental reports database 306, a subscription based information 
database 308, and/or market report database 310. Some of these information sources 
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may comprises exclusively internal or external sources while other information 
sources may comprise a combination of internal and external sources. In an 
illustrative example, as shown in Figure 3, the departmental reports database 306 
may exclusively comprise internal information sources, while the subscription based 

5 information database 308, and market report database 310 may comprise exclusively 
external information sources. Continuing with this illustrative example, the 
genomics database 300, the pre-clinical database 302, and the chnical database 304 
may all comprise a combination of internal and external information sources. 
Figure 4 is a schematic diagram of an illustrative knowledge model 208 in 

10 accordance with an embodiment of the present invention. The knowledge model 
comprises a plurality of inter-related items. In an illustrative embodiment of the 
present invention, as illustrated in Figure 4, the items of the knowledge model may 
include: a therapeutic area item 400, a target item 402, disease item 404, a scientist 
item 406, an organization item 408, a patent item 410, a compound item 412, a 

15 literature item 414, a FDA approval item 416, and a drug item 418. 

The inter-relations between the various items of the knowledge model are illustrated 
in Figure 4 by the plurality of connecting lines (e.g., connecting line 420)connecting 
each of the items to various other items. For example, the literature item 414 is 
20 directly related with the target item 402, the scientist item 406, the organization item 
408, the compound item 412, and drug item 418. Also shown in Figure 4 are the 
marmer or type of relationship 422 provided by the associated connecting line. For 
example, the relationship between the literature item and the scientist item is the 
term "Authored." 

25 

The knowledge model also helps to provide an organizational structure to the index 
generated in the knowledgebase framework so that the items of the generated index 
are arranged according to the organization structure. In one embodiment of the 
present invention, the organizational structure of the generated index maybe based 
30 on the inter-relations between the items of the knowledge model. 
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Figure 5 illustrates a flowchart for a process 500 for a knowledgebase framework in 
accordance with an embodiment of the present invention, hiformation is obtained 
from at least one knowledge source utilizing a network in operation 502. Utilizing 
a knowledge model, an index is generated for the obtained information in operation 

5 504. This generated index includes a plurality of items (or entries) each associated 
with at least some of the obtained information. The generated index may then be 
stored in an index database. UtiUzing the network, the generated index is displayed 
to a user in operation 506. The user is permitted to select an least one of the items of 
the index in operation 508. After receipt of the selection by the user, the information 

10 associated with the selected item is then displayed to the user utilizing the network 
in operation 510. 

In an aspect of the present invention, one of the knowledge sources from which 
information is obtained may be an internal source under the control of entity that 
1 5 employs the user and whose information therein may be proprietary to the entity. 
Some illustrative examples of internal sources include: a genomics database, a pre- 
clinical database, a clinical database, and/or a departmental reports database. 

In another aspect of the present invention, one of the knowledge sources from which 
20 information is obtained may be an external source (e.g., a website) accessible 

utilizing a wide are network such as the Internet and World Wide Web. In general, 
the external sources may not typically be under the confrol of entity that employs the 
user. Some illustrative examples of external sources include subscription based 
information, and/or market reports. 

25 

In a turther aspect of the present invention, the information obtained from the 
sources may include pharmaceutical information such as, for example, information 
relating to: a pharmaceutical therapeutic area, a pharmaceutical target, a 
pharmaceutical compound, a disease, a patent, the Federal Drug Administration 
30 (FDA) (such as information regarding FDA approval of a pharmaceutical), a person 
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researching or working on a pharmaceutical, and/or pharmaceutical literature such as 
a periodical. 

In an embodiment of the present invention, the network may be utilized to monitor 
5 one or more of the knowledge sources for updated information relating to one or 
more items in the index, hi such an embodiment, when updated information is 
detected at one of the knowledge sources, a notice may be generated regarding the 
updated information. This notice may then be transmitted to the user utilizing the 
network to notify the user of the updated information. As an option, the user may be 
10 allowed to select the knowledge source(s) to be monitored for updates or other 
changes. 

In another embodiment of the present invention, the user may be permitted to input a 
search term for searching the index utilizing the network. Upon receipt of the search 
] 5 term, the index may be searched for items associated with the search term. Items of 
the index associated with the input search term (i.e., that match the search term) may 
then be displayed to the user utilizing the network. 

In one aspect of the present invention, the items of the index may be organized and 
20 displayed in some sort of a hierarchical format such as, for example, a hierarchical 
tree format. In yet a further aspect of the present invention, displaying of the 
information associated with the selected item (or entry) to the user may also include 
utilizing the network to retrieve the associated information from the knowledgebase 
source (such as a website) from which the associated information was obtained. In 
25 even another aspect of the present invention, the network may be capable of 
communicating using TCP/IP protocol. 

Figure 6 illustrates a flowchart for a process 600 for a knowledge model in 
accordance with an embodiment of the present invention. A network is utilized to 
30 access or one or more knowledge sources in operation 602. Information from the 
knowledge source is then exfracted utilizing the network in operation 604. The 
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extracted information may have its own formatting. The format of the extracted 
information is converted in operation 606 into a common or standardized format 
(i.e., cleansing and restructuring). An index is generated for the standardized 
extracted information utiHzing a knowledge model in operation 608. The generated 
5 index is then stored in an index database in operation 610. 

hi one aspect of the present invention, the knowledge model may include a plurality 
of inter-associated or inter-related items. In such an aspect, generation of the index 
may include associating the extracted information with one or more of the items of 

] 0 the model, and then mapping the extracted information to the associated item. In 
this manner, when the index is displayed to a user, selection of the item by a user 
hnks the user to the associated information and the source of the information. As an 
option, the items of the knowledge model may include a therapeutic area item, a 
target item, disease item, a scientist item, an organization item, a patent item, a 

15 compound item, a literature item, a FDA approval item, and/or a drug item. 

In even another aspect of the present invention, the knowledge model may also 
provide an organizational structure to the generated index so that the items of the 
generated index are arranged according to the organization structure. As an 

20 illustrative example, the organizational structure may be a hierarchical tree of the 
items. In a further aspect of the present invention, the extracted information may 
include pharmaceutical information. In another aspect of the present invention, the 
knowledge sources may include one or more an internal knowledge sources, and/or 
an external knowledge sources. In yet still another aspect of the present invention, 

25 the network may be capable of communicating using TCP/IP protocol. 

In an embodiment of the present invention, a user may be permitted to access the 
database utilizing the network to retrieve the stored index. In another embodiment 
of the present invention, a query may be received utilizing the network whereupon, 
30 the index may be searched for information matching the query to thereby permit 
retrieval of the matching information utilizing the network. 



28 



In one embodiment of the present invention, the knowledgebase framework 200 may 
be used to help a user learn about a field and/or catch up on new developments in 
this field, hi an embodiment of the present invention, a user may be able to use the 

5 knowledgebase fi-amework 200 to find people who are involved in the area being 
studied and their background, previous research work done in the area (which in an 
illustrative embodiment may include a hst of targets, compounds and drugs), and 
obtain research reports relating to the area being studied. Also, the user may utihze 
the knowledgebase fi-amework 200 to find information fi-om external sources such 

10 as, for example: recent patents, targets, compoimds, and drugs relating to the area 
being studied, as well the people (such as scientists) who are actively working in this 
field or area of study. 

Figure 7 is a schematic illustration of a log in frame 700 for permitting a user to log 
15 into the knowledgebase framework utilizing a network in accordance with an 

embodiment of the present invention. In one aspect of the present invention, the log 
in frame 700 may include a user name field 702 for permitting a user to enter a user 
name associated therewith, a password field 704 for permitting the user to enter a 
password associated with the user and the user name, and a selectable log in button 
20 706 for permitting the user to log into the knowledgebase framework upon selection 
thereof. 

Upon logging in, the user has access to the knowledgebase framework utilizing the 
decision support application 216 to obtain information in the area of their study. 

25 Figure 8 is a schematic illustration of a search and browse frame 800 of the decision 
support application for permitting a user browse the knowledge model-based index 
in accordance with an embodiment of the present invention. Displayed in the search 
and browse frame 800 is a portion of the items of an illusfrative knowledge model- 
based index, in this illusfrative example, the items of the index being displayed 

30 comprises the therapeutic area items 802 of the illustrative index including, for 

example, a cardiology/vascular diseases 804 therapeutic area item. The items of the 
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index displayed in the search and browse frame 800 also comprise selectable links 
which upon selection thereof by the user leads to the retrieval from the index 
database 210 and the displaying in the search and browse frame 800 of items of the 
knowledge model-based index related to the selection item. 

5 

Figure 9 is a schematic illusfration of the items of the knowledge model-base index 
displayed in the search and browse frame 800 upon selection of the 
cardiology/vascular diseases item 804 in Figure 8 in accordance with an 
embodiment of the present invention. These items also comprise selectable links so 
1 0 that selection of one of them leads to the retrieving and display of other items of the 
index related to the selected item. Included in Figure 9, are disease items 900 and 
biological target items 902 of the illustrative index related to the selected therapeutic 
target item 804. hi an illustrative example, one of the biological target items of the 
index being displayed in Figure 9 may include an ACE biological target item 904. 

15 

Figure 10 is a schematic illustration of the items of the knowledge model-base index 
displayed in the search and browse frame 800 upon selection of the ACE biological 
target item 904 in Figure 9 in accordance with an embodiment of the present 
invention. These items also comprise selectable links so that selection of one of 
20 them leads to the retrieving and display of other items of the index related to the 
selected item, hicluded in Figure 10, are component items 1002 of the illusfrative 
index related to the selected item. Li an illustrative example, one of the compound 
items of the index being displayed in Figure 10 may include a captopril compound 
item 1004. 

25 

As illustrated in Figure 10, in one aspect of the present invention, the search and 
browse frame 800 may include a display bar 1006 in which the hierarchy 1008 in the 
index of the selected item may be displayed. For example, in Figure 10, the 
displayed hierarchy 1008 may comprises the target name ACE, and the therapeutic 
30 area cardiology/vascular. 
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Figure 11 is a schematic illustration of illustrative items of the knowledge model- 
base index displayed in the search and browse frame 800 upon selection of the 
captopril compound item 1004 in accordance with an embodiment of the present 
invention. These items also comprise selectable links so that selection of one of 

5 them leads to the retrieving of information from the knowledgebase framework 200 
and/or other items of the index related to the selected item. Included in Figure 11, 
are illustrative items of the illustrative index related to the selected item (in this case 
captopril). In this illustrative example, the displayed related items include: related 
drug items 1100, related patent items 1102, related scientists and/or people items 

10 1104, related literature items 1106, and related disease items 1108. 

In one embodiment of the present invention, one of the items may be selected (such 
as by clicking the right button of a mouse when the mouse pointer is over the item, 
i.e., "right clicking") to display a pop-up menu 1110 which includes a monitor 
15 selection 1112 and a visit source selection 1114. 

Figure 12 is a schematic illustration of a browser frame 1200 which displays 
information 1202 from knowledge source relating to an item (in this example the 
captopril drug item 1004) upon selection of the visit source selection 1114 of Figure 
20 11 in accordance with an embodiment of the present invention. In the illustrative 
example illusfrated in Figure 12, the information 1202 displayed in the browser 
frame 1200 relates to the drug captopril and is obtained from an external source, in 
this case an illustrative website 1204 on the Internet 

25 Figure 13 is a schematic illusfration of an exemplary display of the display bar 1006 
upon selection of a scientist/person item 1104 in accordance with an embodiment of 
the present invention. When a user selects a scientist/person item 1104, the 
knowledgebase framework may retrieve contact information for the scientist/person 
from a knowledge source and then display the retrieved contact information 1300 in 

30 the display bar 1006. 
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Figure 14 is a schematic illustration of illustrative items of the knowledge model- 
base index displayed in the search and browse frame 800 upon selection of the 
patent item 1116 of Figure 11 in accordance with an embodiment of the present 
invention. In an illustrative embodiment, the displayed items related to the selected 
5 patent item 1116 may include items for patent cited 1400 in the selected patent 1116, 
items for organizations 1402 related to the selected patent, items for FDA approval 
matters 1404 related to the selected patent, items for scientists and/or people 1406 
associated with the selected patent, and items for compoimds 1004 related to the 
selected patent. 

10 

Figure 15 is a schematic illustration of a browser frame 1500 which displays 
information 1502 from knowledge source relating to the selected patent item of 
Figure 14 upon selection of the visit source selection 1114 of Figiure 14 in 
accordance with an embodiment of the present invention. In the illustrative example 
1 5 illustrated in Figure 15, the information 1502 displayed in the browser frame 1500 
relates to US Patent 5,238,924 and is obtained from an external source, in this case, 
the USPTO website's Patent Full Text and Image Database. 

Utilizing the knowledgebase framework, a user may be able to monitor work done 
20 by others, such as scientists researching a particular area or field. This may be 

accomplished by selecting the monitor selection 1110 of a selected item, such as for 
example a scientist item 1118 displayed the search and browse frame of Figure 11. 
In an embodiment of the present invention, any of the items of the index can be 
monitored for changes and updates to related information. For example, a user can 
25 select for monitoring an organization for any new publications, patents, drug 
applications, and recent news associated with the organization. In another 
illusfrative example, a target item may be monitored for any new information related 
to monitored target. 

In one embodiment of the present invention, when the user logs into the decision 
30 support application 216, the user may see the most recent news about the scientists. 
Figure 16 illustrates a recent news frame 1600 that the user may access after logging 
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on to the network in accordance with an embodiment of the present invention. In 
one aspect of the present invention, a portion of the recent news frame 1600 may 
comprise a recent news portion 1602 and a monitored items portion 1604. Li such 
an aspect, hnks 1606, 1608 to recently updated information may be displayed. In an 
5 illustrative example, selection of link 1606 (which is a link to a patent) may result in 
the retrieval and displaying of a browser frame similar to the browser frame of 
Figure 15 to display information relating to the selected patent obtained from the 
USPTO website's Patent Full Text and Image Database. 

10 With continuing reference to Figure 16, the links may also comprise hypertext links 
and include textual summaries relating to information that was recently updated (see 
e.g., link 1608). As an option, these links may be organized by subjects or items 
such as for example, the items of the knowledge model 208 and displayed in 
corresponding subheadings. As illusfrated in Figure 16, some illustrative 

15 subheadings may include an organizations subheading 1610, a targets subheading 
1612, and a scientist subheading 1614. 

The monitored items portion 1604 may display a list of items 1616 selected by the 
user to be monitored by the knowledgebase framework. Like the recent news links 
20 1 606, the items 1616 in the monitored items portion 1604 may comprise links to 
access items in the knowledge model-based index. 

With continuing reference to Figure 16, a search bar 1618 may also be displayed to 
the user. The search bar 1618 may include at least one field 1620 for the user to 
25 input a search term. Upon input, a search may be conducted by a search engine for 
items in the index related to the inputted term(s). Figure A 

Figure 17 illustrates a flowchart for a process 1700 for monitoring information in a 
knowledgebase framework in accordance with an embodiment of the present 
invention. UtiUzing a network, information is received from a user that relates to a 
30 target to be tracked in operation 1702. The network is monitored for changes or 
updates to the target 1704. When a change is monitored, data relating to the 
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monitored changes is retrieved in operation 1706 and then transmitted to the user 
utilizing the network in operation 1708. 

hi one aspect of the present invention, the target may be an item of index displayed 
5 to the user utilizing the network, hi another aspect of the present invention, the 
target maybe: a publication (e.g., literature), a person (e.g., scientist) ,a therapeutic 
area, a disease, a biological target, an organization, a compound, a patent, FDA 
approval, and/or a drug. 

10 hi a further aspect of the present invention, a pharmaceutical database may be 

monitored for changes or updates relating to the target, hi yet another aspect of the 
present invention, the network may comprise an intranet of an organization and the 
hitemet. 

15 hi an embodiment of the present invention, the received information may be stored 
in memory. In another embodiment of the present invention, the retrieved data may 
be transmitted to the user after receipt of an indication that the user has logged on to 
the network. As an option to such an embodiment, the retrieved data may be 
automatically transmitted to the user after receipt of the indication that the user has 

20 logged on to the network. 

In a further embodiment of the present invention, the user may be alerted that a 
change or update to the target has been monitored utilizing the network. In even 
another embodiment of the present invention, the user may be permitted to input a 
25 search term utilizing the network. In such an embodiment, items associated with the 
search term may be searched for upon receipt of the search term. Then those items 
which have been found to be associated with the inputted search term may be 
displayed to the user utihzing the network. 

30 Figure 18A is a schematic illustration of a process map 1800 displayed in a research 
frame 1802 for use in a knowledgebase framework in accordance with an 
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embodiment of the present invention. With the research frame 1802, a user may be 
able to share a set of common processes and report templates with one or more other 
users to document their findings for further collaboration. The process map 
graphically depicts a timeline for one or more processes. In the illustrative example 
5 illustrated in Figure 18 A, the depicted process map 1800 may be for process for the 
development of a product 1804 in a pharmaceutical setting. 

The research frame 1802 may also include selectable links for accessing various 
tools for the research frame such as for example, templates 1806 and target tracking 
10 tools 1808. Figiu-e 18B is a schematic illustration of an illustrative template 1810 
for a process map displayed in the research frame 1802 in accordance with an 
embodiment of the present invention. 

Figure 18C illusfrates an inbox 1812 for managing email messages displayed in an 
15 organizer frame 1814 for use in a knowledgebase framework in accordance with an 
embodiment of the present invention. The organizer frame may also include a 
plurality of tool links for accessing features of the organizer frame. Illustrative 
examples of tool links include a mail link 1816 for displaying the inbox upon 
selection thereof and a calendar link 1818. Also illustrated in Figure 18C is a puU- 
20 down menu 1820 that may be included in the search bar 1618 to further narrow a 
search of terms input into the search bar. 

Figure 18D illusfrates a calendar 1822 for managing events and appointments of a 
user displayed in an organizer frame 1814 for use in a knowledgebase framework in 
25 accordance with an embodiment of the present invention. The calendar 1822 may 
be displayed upon selection of the calendar link 1818 by a user. 

The following example describes an illustrative scenario for utilizing the 
knowledgebase framework in accordance with an embodiment of the present 
30 invention. 
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EXAMPLE 

Anne Kline, a senior biologist at Acme Pharmaceutical, has just transferred 
from the Oncology department to the Cardiovascular department. She has a 
5 reasonably strong background in Cardiovascular. Prior to joining Acme 

Pharmaceutical, she worked at the Imperial College School of Medicine's 
Cardiovascular department for a couple years. However, she has not been 
active in this area since she joined Acme Pharmaceutical 3 years ago. She 
needs to catch up with the new developments in this area ~ inside and 
] 0 outside Acme Pharmaceutical. Acme Pharmaceutical has just installed a 

knowledgebase framework. The knowledgebase framework allows Acme 
Pharmaceutical's scientists to search, browse and monitor internal and 
external information available to them. Anne accesses the knowledgebase 
framework from her computer desktop. 
15 Anne accesses the knowledgebase framework from her computer desktop. 

She spends almost the entire day using the knowledgebase framework and at 
the end of the day she is able to find: 

-The people in Acme Pharmaceutical who are involved in the 

cardiovascular area and their background 
20 —Previous research work done within Acme Pharmaceutical (which 

includes a list of targets, compounds and drugs) 

-Internal research reports 
In addition, Anne also finds usefiil information from external sources such as 
recent: 

25 -Patents 
-Targets 
—Compounds 
-Drugs 

-as well the scientists who are actively working in this area 
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In addition, Anne finds two scientists whose work seem to be relevant to her 
first assignment. She sets up her profile in the knowledgebase framework in 
such a way that it will monitor any future work done by these scientists ... 
The next time Anne accesses the knowledgebase framework, she will see the 
5 most recent news about those two scientists. She also knows that Merck has 

been very active in the Cardiovascular area. She sets up the knowledgebase 
firamework to monitor any new pubhcations, patents, drug applications by 
Merck The next time Anne accesses the knowledgebase framework, she will 
see the most recent news about Merck. 

10 

Anne's first assignment is to investigate TR27 K-Channel as a potential 
target for hypertension treatment. She uses the knowledgebase framework to 
find out any previous work related TR27. She finds only one article that are 
somewhat relevant. Since she will be working on this target for awhile, she 
15 sets up the knowledgebase framework to monitor any new information 

related to TR27. One morning a couple days later, Arme turns on her 
computer and the knowledgebase framework informs her that Pfizer has filed 
a patent and this patent has cited TR27. Anne quickly browses through the 
patent. Luckily, the patent cited TR27 for a different reason. 

20 

Later on that day, the knowledgebase framework informs her that there is a 
newly released internal report that mentioned this particular target. This 
report was filed by the Newology department, right after the High 
Throughput Screening was conducted on the target. She downloads the 

25 report and studies it carefully. 

She launches Target DB, a tool that stores information of all targets 
investigated by Acme Pharmaceutical, from the knowledgebase framework 
to find out the details information about assay used for TR27. With help 
from the knowledgebase framework, Aime figures out the person involved 

30 with this target. She is able to contact one of that researcher for further 

information. 
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While her testing procedures will be different, Anne is able to use many parts 
of the results as a starting point. This encounter has saved her a few months 
of hard work. The two researchers are able to share a set of common 
5 processes and report templates to document their findings for further 

collaboration. 

Figure 19 is a schematic diagram of an exemplary system architecture in accordance 
with an embodiment of the present invention. 

10 

In accordance with an embodiment of the present invention, a BackgrovindFinder 
(BF) is implemented as an agent responsible for preparing an individual for an 
upcoming meeting by helping him/her retrieve relevant information about the 
meeting from various sources. BF receives input text in character form indicative of 

15 the target meeting. The input text is generated in accordance with an embodiment of 
the present invention by a calendar program that includes the time of the meeting. 
As the time of the meeting approaches, the calendar program is queried to obtain the 
text of the target event and that information is utilized as input to the agent. Then, 
the agent parses the input meeting text to extract its various components such as 

20 title, body, participants, location, time etc. The system also performs pattern 

matching to identify particular meeting fields in a meeting text. This information is 
utilized to query various sources of information on the web and obtain relevant 
stories about the current meeting to send back to the calendaring system. For 
example, if an individual has a meeting with Netscape and Microsoft to talk about 

25 their disputes, and would obtain this initial information from the calendaring system. 
It will then parse out the text to realize that the companies in the meeting are 
"Netscape" and "Microsoft" and the topic is "disputes." Then, the system queries 
the web for relevant information concerning the topic. Thus, in accordance with an 
objective of the invention, the system updates the calendaring system and eventually 

30 the user with the best information it can gather to prepare the user for the target 
meeting. In accordance with an embodiment of the present invention, the 
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information is stored in a file that is obtained via selection from a link imbedded in 
the calendar system. 

Program Organization: 

5 A computer program in accordance with an embodiment of the present invention is 
organized in five distinct modules: BF.Main, BF.Parse, Background Finder .Error, 
BF.PattemMatching and BF.Search. There is also a frmMain which provides a user 
interface used only for debugging purposes. The executable programs in accordance 
with an embodiment of the present invention never execute with the user interface 

10 and should only return to the calendaring system through Microsoft's Winsock 

control. An embodiment of the system executes in two different modes which can 
be specified under the command line sent to it by the calendaring system. When the 
system runs in simple mode, it executes a keyword query to submit to external 
search engines. When executed in complex mode, the system performs pattern 

15 matching before it forms a query to be sent to a search engine. 

Data Structures: 

The system in accordance with an embodiment of the present invention utilizes three 
user defined structures: 
20 TMeetingRecord; 

TPattemElement; and 
TPattemRecord. 

The user-defined structure, tMeetingRecord, is used to store all the pertinent 
25 information conceming a single meeting. This info includes userlD, an original 

description of the meeting, the extracted hst of keywords from the title and body of 
meeting etc. It is important to note that only one meeting record is created per 
instance of the system in accordance with an embodiment of the present invention. 
This is because each time the system is spawned to service an upcoming meeting, it 
30 is assigned a task to retrieve information for only one meeting. Therefore, the 
meeting record created corresponds to the current meeting examined. 
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ParseMeetingText populates this meeting record and it is then passed around to 
provide information about the meeting to other functions. 
If GoPattemMatch can bind any values to a particular meeting field, the 
corresponding entries in the meeting record is also updated. The structure of 
5 tMeetingRecord with each field described in parentheses is provided below in 
accordance with an embodiment of the present invention. 



Public Type tMeetingRecord 

sUserlD As String (user id given by Munin) 

sTitleOrig As String (original non stop listed title we need to keep aroimd to 

send back to Munin) 
sTitleKW As String (stoplisted title with only keywords) 
sBodyKW As String (stoplisted body with only keywords) 
sCompanyO As String (companies identified in title or body through pattern 

matching) 

sTopicO As String (topics identified in title or body through pattern 
matching) 

sPeopleO As String (people identified in title or body through pattern 
matching) 

sWhen() As String (time identified in title or body through pattern 
matching) 

sWhereO As String (location identified in title or body through pattern 
matching) 

sLocation As String (location as passed in by Munin) 
sTime As String (time as passed in by Munin) 

sParticipantsO As String (all participants engaged as passed in by Munin) 
sMeetingText As String (the original meeting text w/o userid) 
End Type 



30 



There are two other structures which are created to hold each individual pattern 
utilized in pattern matching. The record tAPattemRecord is an array containing all 
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the components / elements of a pattern. The type tAPattemElement is an array of 
strings which represent an element in a pattern. Because there may be many 
"substitutes" for each element, we need an array of strings to keep track of what all 
the substitutes are. The structures of tAPattemElement and tAPattemRecord are 
5 presented below in accordance with an embodiment of the present invention. 

Public Type tAPattemElement 

elementArrayO As String 
End Type 
10 Public Type tAPattemRecord 

pattemArrayO As tAPattemElement 
End Type 

User Defined Constants: 

1 5 Many constants are defined in each declaration section of the program which may 
need to be updated periodically as part of the process of maintaining the system in 
accordance with an embodiment of the present invention. The constants are 
accessible to allow dynamic configuration of the system to occur as updates for 
maintaining the code. 

20 

hicluded in the following tables are lists of constants fi-om each module which I 
thought are most likely to be modified from time to time. However, there are also 
other constants used in the code not included in the following list. It does not mean 
that these non-included constants will never be changed. It means that they will 
25 change much less frequently. 



For the Main Module (BF.Main) : 



CONSTANT 


PRESET VALUE 


USE 


MSGTOMUNIN_TYPE 


6 


Define the message number used t 
identify messages between BF anc 
Munin 
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CONSTANT 


PRESET VALUE 


USE 


IP_ADDRESS_MUNIN 


"10.2.100.48" 


Define the IP address of the machii 
in which Munin and BF are nmnin 
on so they can transfer data throug 
UDP. 


PORT_MUNIN 


1111 


Define the remote port in which 
we are operating on. 


TIMEOUT_AV 


60 


Define constants for setting time oi 
in inet controls 


TIMEOUT NP 


60 


Define constants for setting time oi 
in inet controls 


CMD_SEPARATOR 


"\" 


Define delimiter to tell which part ( 
Munin's command represents the 
beginning of our input meeting tex 


OUTPARAM_SEPARAT0 




Define delimiter for separating ou 
different portions of the output. Tl 

separator is for delimiting the ms^ 
type, the user id, the meeting title ai 

the begiiming of the actual stories 
retrieved. 



For the Search Module (BF. Search): 



CONSTANT 


CURRENT VALU 


USE 


PAST_NDAYS 


5 


Define number of days you want t< 
look back for AltaVista articles. 
Doesn't really matter now because \ 
aren't really doing a news search ii 
alta vista. We want all info. 


CONNECTOR_AV_URL 


"+AND+" 


Define how to connect keywords. V 
want all our keywords in the string 
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CONSTANT 


CURRENT VALU 


USE 






for now use AND. If you 
want to do an OR or something, jui 
change connector. 


CONNECTOR NP URL 


"+AND+" 


Define how to connect keywords. ^ 
want all our keywords in the string 

for now use AND. If you 
want to do an OR or something, ju! 
change connector. 


NUM_NP_STORIES 


3 


Define the number of stories to retu 
back to Mimin from NewsPage. 


NUM_AV_STORIES 


3 


Define the number of stories to retu 
back to Munin from AltaVista. 



For the Parse Module (BF.Parse): 



CONSTANT 


CURRENT VALU 


USE 


PORTION_SEPARATOR 




Define the separator between diffen 
portions of the meeting text sent in 
Munin. For example in "09::Meet w 
Chad::about life::Chad j Denise 
"::" is the separator between differe 
parts of the meeting text. 


PARTICIPANT_SEPARATOR 




Define the separator between eacl 
participant in the participant list 
portion of the original meeting tex 
Refer to example above. 



For Pattern Matching Module (BFPattemMatch): There are no constants in this 
module which require frequent updates. 
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General Process Flow: 

The best way to depict the process flow and the coordination of functions between each 
other is with the five flowcharts illustrated in Figures 20 to 24. Figure 20 depicts the 
overall process flow in accordance with an embodiment of the present invention. 

5 Processing commences at the top of the chart at function block 2000 which launches 
when the program starts. Once the application is started, the command line is parsed to 
remove the appropriate meeting text to initiate the target of the background find 
operation in accordance with an embodiment of the present invention as shown in 
function block 2010. A global stop list is generated after the target is determined as 

10 shown in function block 2020. Then, all the patterns that are utilized for matching 

operations are generated as illustrated in function block 2030. Then, by tracing through 
the chart, function block 2000 invokes GoBF 2040 which is responsible for logical 
processing associated with wrapping the correct search query information for the 
particular target search engine. For example, function block 2040 flows to function 

15 block 2050 and it then calls GoPattemMatch as shown in function block 2060. To see 
the process flow of GoPattemMatch, we swap to the diagram titled "Process Flow for 
BF's Pattern Matching Unit." 

One key thing to notice is that functions depicted at the same level of the chart are called 
20 by in sequential order jSrom left to right (or top to bottom) by their common parent 
function. For example, Main 2000 calls ProcessCommandLine 2010, then 
CreateStopListist 2020, then CreatePattems 2030, then GoBackgroundFinder 2040. 
Figures 21 to 24 detail the logic for the entire program, the parsing unit, the pattern 
matching unit and the search unit respectively. Figure 24 details the logic determinative 
25 of data flow of key information through BackgroimdFinder, and shows the functions 
that are responsible for creating or processing such information. 

DETAILED SEARCH ARCHITECTURE UNDER THE BASIC SEARCH / 
30 SIMPLE QUERY MODE 



44 



Search ALTA VISTA (Function block 2070 of Figure 20): 

The Alta Vista search engine utilizes the identifies and returns general information 
about topics related to the current meeting as shown in function block 270 of Figure 
2. The system in accordance with an embodiment of the present invention takes all 

5 the keywords from the title portion of the original meeting text and constructs an 
advanced query to send to Alta Vista. The keywords are logically combined 
together in the query. The results are also ranked based on the same set of 
keywords. One of ordinary skill in the art will readily comprehend that a date 
restriction or publisher criteria could be facilitated on the articles we want to 

10 retrieve. A set of top ranking stories are returned to the calendaring system in 
accordance with an embodiment of the present invention. 



NewsPage (Function Block 2075 of Figure 20): 

The NewsPage search system is responsible for giving us the latest news topics 
15 related to a target meeting. The system takes all of the keywords from the title 

portion of the original meeting text and constructs a query to send to the NewsPage 
search engine. The keywords are logically combined together in the query. Only 
articles pubhshed recently are retrieved. The NewsPage search system provides a 
date restriction criteria that is settable by a user according to the user's preference. 
20 The top ranking stories are returned to the calendaring system. 

Figure 21 is a user profile data model in accordance with an embodiment of the 
present invention. Processing commences at ftinction block 2100 which is 
responsible for invoking the program from the main module. Then, at function 

25 block 2110, a wrapper fimction is invoked to prepare for the keyword extraction 
processing in fimction block 2120. After the keywords are exfracted, then 
processing flows to fimction block 2130 to determine if the delimiters are properly 
positioned. Then, at function block 2140, the number of words in a particular string 
is calculated and the delimiters for the particular field are and a particular field from 

30 the meeting text is retrieved at fimction block 2150. Then, at fimction block 2180, 
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the delimiters of the string are again checked to assure they are placed appropriately. 
Finally, at function block 2160, the extraction of each word from the title and body 
of the message is performed a word at a time utilizing the logic in function block 
2162 which finds the next closest word delimiter in the input phrase, function block 
5 2164 which strips unnecessary materials from a word and function block 2166 

which determines if a word is on the stop list and returns an error if the word is on 
the stop Ust. 



Pattern Matching: 
Limitations associated with a simple searching method include: 

1 . Because it relies on a stop list of unwanted words in order to extract from the 

meeting text a set of keywords, it is limited by how comprehensive the stop 
list is. Listead of trying to figure out what parts of the meeting text we 
should throw away, we should focus on what parts of the meeting text we 
want. 

2. A simple search method in accordance with an embodiment of the present 

invention only uses the keywords from a meeting title to form queries to 
send to Alta Vista and NewsPage. This ignores an alternative source of 
information for the query, the body of the meeting notice. We cannot 
include the keywords from the meeting body to form our queries because this 
often results in queries which are too long and so complex that we often 
obtain no meaningful results. 

3. There is no way for us to tell what each keyword represents. For example, we 

may exfract "Andy" and "Grove" as two keywords. However, a simplistic 
search has no way knowing that "Andy Grove" is in fact a person's name. 
Imagine the possibilities if we could somehow intelligently guess that "Andy 
Grove" is a person's name. We can find out if he is an Andersen person and 
if so what kind of projects he's been on before etc. etc. 

4. In summary, by rel3dng solely on a stop list to parse out unnecessary words, we 

suffer from "information overload". 
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Pattern Matching Overcomes These Limitations: 

Here's how the pattern matching system can address each of the corresponding 
issues above in accordance with an embodiment of the present invention. 
5 1 . By doing pattern matching, we match up only parts of the meeting text that we 

want and extract those parts. 
2. By performing pattern matching on the meeting body and extracting only the 

parts from the meeting body that we want. Our meeting body will not go to 

complete waste then. 

10 3. Pattern matching is based on a set of templates that we specify, allowing us to 
identify people names, company names etc from a meeting text. 
4. Li summary, with pattern matching, we no longer suffer from information 
overload. Of course, the big problem is how well our pattem matching 
works. If we rely exclusively on artificial intelligence processing, we do not 

15 have a 100% hit rate. We are able to identify about 20% of all company 

names presented to us. 

Patterns: 

A pattem in the context of an embodiment of the present invention is a template 
20 specifying the structure of a phrase we are looking for in a meeting text. The 

patterns supported by an embodiment of the present invention are selected because 
they are templates of phrases which have a high probability of appearing in 
someone's meeting text. For example, when entering a meeting in a calendar, many 
would write something such as "Meet with Bob Dutton from Stanford University 
25 next Tuesday." A common pattem would then be something like the word "with" 
followed by a person's name (in this example it is Bob Dutton) followed by the 
word "from" and ending with an organization's name (in this case, it is Stanford 
University). 

30 Pattern Matching Terminology: 

Terminology associated with pattem matching includes: 
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• Pattern: a pattern is a template specifying the structure of a phrase we want 
to bind the meeting text to. It contains sub units. 

• Element: a pattern can contain many sub-units. These subunits are called 
elements. For example, in the pattern "with $PEOPLE$ from 

5 $COMPANY$", "with" "$PEOPLE$" "from" "$COMPANY$" are all 

elements. 

• Placeholder: a placeholder is a special kind of element in which we want to 
bind a value to. Using the above example, "$PEOPLE$" is a placeholder. 

• Indicator: an indicator is another kind of element which we want to find in a 
1 0 meeting text but no value needs to bind to it. There may be often more than 

one indicator we are looking for in a certain pattern. That is why an indicator 
is not an "atomic" type. 

• Substitute: substitutes are a set of indicators which are all synonyms of each 
other. Finding any one of them in the input is good. 

15 

There may be five fields which are identified for each meeting: 

Company ($COMPANY$) 

• People ($PEOPLE$) 

• Location ($LOCATION$) 
20 • Time ($TIME$) 

• Topic ($TOPIC_UPPER$) or ($TOPIC_ALL$) 

hi parentheses are the illusfrative placeholders used in the code as representation of 
the corresponding meeting fields. 

25 Each placeholder may have the following meaning: 

• $COMPANY$: binds a string of capitalized words (e.g., Meet with Joe 
Carter of <Andersen Consulting >) 

• $PEOPLE$: binds series of string of two capitahzed words potentially 
connected by "and" or "&" (e.g.. Meet with <Joe Carter> of Andersen 

30 Consulting, Meet with <Joe Carter and Luke Hughes> of Andersen 

Consulting) 
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• $LOCATION$: binds a string of capitalized words (e.g.. Meet Susan at 
<Palo Alto Square>) 

• $TIME$: binds a string containing the format #:## (e.g.. Dinner at <6:30 
pm>) 

5 • $TOPIC_UPPER$: binds a string of capitalized words for our topic (e.g., 
<Stanford Engineering Recruiting> Meeting to talk about new hires). 

• $TOPIC_ALL$: binds a string of words without really caring if it's 
capitahzed or not. (e.g.. Meet to talk about <ubiquitous computing>) 



10 The following table represents patterns supported by BF. Each pattern belongs to a 
pattern group. All patterns within a pattern group share a similar format and they 
only differ from each other in terms of what indicators are used as substitutes. Note 
that the patterns which are grayed out are also commented in the code. BF has the 
capability to support these patterns but we decided that matching these patterns is 

1 5 not essential at this point. 



1 


1.1.1.1.1.: 


$PEOPLE$ of 
$COMPANY$ 


Paul Maritz of Microsoft 




b 


$PEOPLE$ from 
$COMPANY$ 


Bill Gates, Paul Allen and 
Paul Maritz from Microsoft 


2 


a 


$TOPIC_UPPER$ meeting 


Push Technology Meeting 




b 


$TOPIC_lJPPER$ mtg 


Push Technology Mtg 




c 


$TOPIC_UPPERS demo 


Push Technology demo 




d 


$TOPIC_UPPER$ 
interview 


Push Technology interview 




e 


$TOPIC_UPPER$ 


Push Technology 
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presentation 


presentation 




f 


$TOPIC_UPPER$ visit 


Push Technology visit 




g 


$TOPIC_UPPER$ briefing 


Push Technology briefing 




h 


$TOPIC_UPPER$ 
discussion 


Push Technology 
discussion 






$TOPIC_UPPER$ 
workshop 


Push Technology 
workshop 




j 


$TOPIC_UPPER$ prep 


Push Technology prep 




k 


$TOPIC_UPPER$ review 


Push Technology review 




1 


$TOPIC_UPPER$ lunch 


Push Technology lunch 




m 


$TOPIC_UPPER$ project 


Push Technology project 




n 


$TOPIC_UPPER$ projects 


Push Technology projects 


3 


a 


$COMPANY$ corporation 


Intel Corporation 




b 


$COMPANY$ corp. 


IBM Corp. 




c 


$COMPANY$ systems 


Cisco Systems 




d 


$COMPANY$ limited 


IBM limited 






$COMPANY$ ltd 


IBM ltd 


4 


a 


about $TOPIC_ALL$ 


About intelligent agents 
technology 




b 


discuss $TOPIC ALL$ 


Discuss intelligent agents 
technology 






show$TOPIC ALLS 


Show the client our 
intelligent agents 
technology 




d 


re: $TOPIC_ALL$ 


re: intelligent agents 
technology 




e 


review $TOPIC ALLS 


Review intelligent agents 
technology 




f 


agenda 


The agenda is as follows: 
—clean up 
—clean up 
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—clean up 




g 


agenda: $TOPIC_ALL$ 


Agenda: 

—demo client intelligent 
agents technology, 
—demo ecommerce. 


5 


a 


w/$PEOPLE$ of 
$COMPANY$ 


Meet w/Joe Carter of 
Andersen Consulting 




b 


w/$PEOPLE$ from 
SCOMPANYS 


Meet w/Joe Carter from 
Andersen Consulting 


6 


a 


w/$COMPANY$ per 
$PEOPLE$ 


TaUc w/Intel per Jason 
Foster 


■7 


a 


At STIMES- 


at 3:0C)pm 




b 


Around ST1ME$ 


Around 3:00 pm 


8 


a 


At $LOCATION$ 


At LuLu's resturant 




b 


In $LOCATION$ 


in Santa Clara 


9 




Per $PEOPLE$ 


per Susan Butler 


10 


a 


call w/$PEOPLE$ 


Conf call w/John Smith 




B 


call with $PEOPLE$ 


Conf call with John Smith 


11 


A 


prep for $TOPIC_ALL$ 


Prep for London meeting 




B 


preparation for 
$TOPIC_ALL$ 


Preparation for London 
meeting 



Figure 22 is a detailed flowchart of pattern matching in accordance with an 
embodiment of the present invention. Processing commences at function block 

5 2200 where the main program invokes the pattern matching application and passes 
control to fimction block 2210 to commence the pattern match processing. Then, at 
ftmction block 2220, the wrapper function loops through to process each pattern 
which includes determining if a part of the text string can be bound to a pattern as 
shown in ftmction block 2230. Then, at function block 2240, various placeholders 

10 are bound to values if they exist, and in function block 2241, a list of names 
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separated by punctuation are bound, and at function block 2242 a full name is 
processed by finding two capitalized words as a full name and grabbing the next 
letter after a space after a word to determine if it is capitalized. Then, at function 
block 2243, time is parsed out of the string in an appropriate manner and the next 

5 word after a blank space in function block 2244. Then, at function block 2245, the 
continuous phrases of capitahzed words such as company, topic or location are 
bound and in function block 2246, the next word after the blank is obtained for 
further processing in accordance with an embodiment of the present invention. 
Following the match meeting field processing, function block 2250 is utilized to 

1 0 locate an indicator which is the head of a pattern, the next word after the blank is 
obtained as shown in function block 2252 and the word is checked to determine if 
the word is an indicator as shown in function block 2254. Then, at function block 
2260, the string is parsed to locate an indicator which is not at the end of the pattern 
and the next word after imnecessary white space such as that following a line feed or 

15 a carriage return is processed as shown in function block 2262 and the word is 

analyzed to determine if it is an indicator as shown in function block 2264. Then, in 
function block 2270, the temporary record is reset to the null set to prepare it for 
processing the next string and at function block 2280, the meeting record is updated 
and at function block 2282 a check is performed to determine if an entry is already 

20 made to the meeting record before parsing the meeting record again. 

Using the Identified Meeting Fields: 

Now that we have identified fields within the meeting text which we consider 
important, there are quite a few things we can do with it. One of the most important 

25 applications of pattern matching is of course to improve the query we construct 

which eventually gets submitted to Alta Vista and News Page. There are also a lot of 
other options and enhancements which exploit the results of pattern matching that 
we can add to BF. These other options will be described in the next section. The 
goal of this section is to give the reader a good sense of how the results obtained 

30 fi-om pattern matching can be used to help us obtain better search results. 
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Figure 23 shows a flowchart of the detailed processing for preparing a query and 
obtaining information from the hitemet in accordance with an embodiment of the 
present invention. Processing commences at function block 2300 and immediately 
flows to function block 2310 to process the wrapper functionahty to prepare for an 

5 Intemet search utilizing a web search engine. If the search is to utilize the Alta Vista 
search engine, then at function block 2330, the system takes information from the 
meeting record and forms a query in function blocks 2340 to 2360 for submittal to 
the search engine. If the search is to utilize the NewsPage search engine, then at 
function block 2320, the system takes information from the meeting record and 

10 forms a query in function blocks 2321 to 2328. 

Alta Vista Search Engine: 

A strength of the Alta Vista search engine is that it provides enhanced flexibility. 
Using its advance query method, one can construct all sorts of Boolean queries and 

1 5 rank the search however you want. However, one of the biggest drawbacks with 
Alta Vista is that it is not very good at handhng a large query and is hkely to give 
back irrelevant results. If we can identify the topic and the company within a 
meeting text, we can form a pretty short but comprehensive query which will 
hopefully yield better results. We also want to focus on the topics found. It may not 

20 be of much merit to the user to find out info about a company especially if the user 
already knows the company well and has had numerous meetings with them. It's 
the topics they want to research on. 

News Page Search Engine: 

25 A strength of the News Page search engine is that it does a great job searching for 
the most recent news if you are able to give it a valid company name. Therefore 
when we submit a query to the news page web site, we send whatever company 
name we can identify and only if we cannot find one do we use the topics found to 
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form a query. If neither one is found, then no search is performed. The algorithm 
utilized to form the query to submit to Alta Vista is illustrated in Figure 25. The 
algorithm that we will use to form the query to submit to News Page is illustrated in 
Figure 26. 

5 

The following table describes in detail each function in accordance with an embodiment 
of the present invention. The order in which fimctions appear mimics the process flow 
as closely as possible. When there are situations in which a function is called several 
times, this function will be listed after the first function which calls it and its description 
10 is not duplicated after every subsequent function which calls it. 



Main 


Pubhc 


None 


This is the main function 


(BF.Main) 


Sub 




where the program first 








launches. It initializes BF 








with the appropriate 








parameters(e.g. Intemet 








time-out, stoplist...) and 








calls GoBF to launch the 








main part of the program. 


ProcessCom 


Private 


Main 


This function parses the 


mandLine 


Sub 




command line. It assumes 


(BF.Main) 






that the delimiter indicating 








the begiiming of input from 








Munin is stored in the 








constant 








CMD_SEPARATOR. 


CreateStopLi 


Private 


Main 


This function sets up a stop 


St 


Function 




list for futiu-e use to parse out 


(BF.Main) 






unwanted words from the 
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meeting text. 








There are commas on each 








side of each word to enable 








straight checking. 


CreatePattem 


Public 


Main 


This procedure is called once 


s 


Sub 




when BF is first initialized to 


(BF.Pattem 






create all the potential 


Match) 






patterns that portions of the 








meeting text can bind to. A 








pattern can contain however 








many elements as needed. 








There are 








two types of elements. The 








first type of elements are 








indicators. These are real 








words which delimit the 








potential of a meeting field 








(e.g. company) to follow. 








Most of these indicators are 








stop words as expected 








because 








stop words are words 








usually common to all 








meeting text so it makes 








sense they form patterns. The 








second type of elements are 








special strings which 








represent placeholders. 








A placeholder is always in 








the form of $*$ where * can 
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be either PEOPLE, 








COMPANY, 








TOPIC_UPPER, 








TIME,LOCATION or 








TOPIC_ALL. A pattern can 








begin with either one of the 








two types of elements and 








can be however long. 








involving however any 








number/type of elements. 








This procedure dynamically 








creates a new pattern record 








for 








each pattern in the table and 








it also dynamically creates 








new tAPattemElements for 








each element within a 








pattern. In addition, there is 








the concept of being able to 








substitute indicators within a 








pattern. For example, the 








pattern $PEOPLE$ of 








$COMPANY$ is similar to 








the pattern $PEOPLE$ from 








$COMPANY$. "from" is a 








substitute for "of . Our 








structure should be able to 








express such a need for 








substitution. 


GoBF 


Public 


Main 


This is a wrapper proceduror 
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(BF.Main) 


Sub 




that calls both the parsing 
and the searching 
subroutines of the 
BF. It is also responsible for 
sending data back to Munin. 


ParseMeetin 


Public 


GoBackGroundF 


This function takes the initial 


gText 


Function 


inder 


meeting text and identifies 


(BF.Parse) 






the userED of the record as 








well as other parts of the 








meeting text including the 








title, body, participant Ust, 








location and time. In 








addition, we call a helper 








function ProcessStopList to 








eliminate all the unwanted 








words from the original 








meeting title and meeting 








body so that only keywords 








are left. The information 








parsed out is stored in the 








MeetingRecord structure. 








Note that this function does 








no error checking and for the 








most time assumes that the 








meeting text string is 








correctly formatted by 








Munin. 








The important variable is 








thisMeeting Record is the 








temp holder for all info 
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regarding current meeting. 








It's eventually returned to 








caller. 


ForniatDelim 


Private 


ParseMeetingTe 


There are 4 ways in which 


itation 




xt, 


the delimiters can be placed. 


(BF.Parse) 




DetermineNum 


We take care of all these 






Words, 


cases by reducing them 






GetAWordFrom 


down to Case 4 in which 






String 


there are no delimiters 








around but only between 








fields in a string(e.g. 








A::B::C) 


DetermineNu 


Public 


ParseMeeting 


This functions determines 


mWords 


Function 


Text, 


how many words there are in 


(BF.Parse) 




ProcessStop 


a string (stInEvalString) The 






List 


function assumes that each 








word is separated by a 








designated separator as 








specified in stSeparator. The 








return type is an integer that 








indicates how many words 








have been found assuming 








each word 








in the string is separated by 








StSeparator. This function is 








always used along with 








GetAWordFromString and 








should be called before 








calling GetAWordFrom 








String. 
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GetAWordFr 


Public 


ParseMeeting 


This function extracts the ith 


omString 


Function 


Text, 


word of the 


(BF.Parse) 




ProcessStop 


string(sthiEvalString) 






List 


assuming that each word in 








the string is separated by a 








designated 








separator contained in the 








variable stSeparator. 








In most cases, use this 








function with 








DetermineNum Words. The 








function returns the wanted 








word. This function checks 








to make sure that 








ilnWordNum is within 








bounds so that i 








is not greater than the total 








number of words in string or 








less than/equal to zero. If it 








is out of bounds, we return 








empty string to indicate we 








can't get anything. We try to 








make sure this doesn't 








happen by calling 








DetermineNum Words first. 


ParseAndCle 


Private 


ParseMeetingTe 


This function first grabs the 


anPhrase 


Function 


xt 


word and send it to 


(BF.Parse) 






Clean Word in order strip 








the stuff that nobody wants. 








There are things in 
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parseWord that will kill 
the word, so we will need a 
method of looping through 
the body and rejecting 
words without killing the 
whole function 
i guess keep CleanWord and 
check a return value 
ok, now I have a word so I 
need to send it down the 
parse chain. This chain goes 
ParseCleanPhrase -> 
CleanWord -> 
EvaluateWord. If the word 
gets through the 
entire chain without being 
killed, it will be added at the 
end to om keyword string, 
first would be the function 
that checks for "/" as a 
delimiter and extracts the 
parts of that. This I will call 
"StitchFace" (Denise is more 
normal and calls it 
GetAWordFromString) 
if this finds words, then each 
of these will be sent, in turn, 
down the chain. If 
these get through the entire 
chain without being added or 
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killed then they will be 








added rather than tossed. 


FindMin 


Private 


ParseAndCleanP 


This function takes in 6 input 


(BF.Parse) 


Function 


hrase 


values and evaluates to see 








w^hat the minimum non 








zero value is. It first creates 








an array as a holder so that 








we can sort the five 








input values in ascending 








order. Thus the minimum 








value will be the first non 








zero value element of the 








array. If we go through 








entire array without finding 








a non zero value, we know 








that there is an error and we 








exit the function. 


CleanWord 


Private 


ParseAndCleanP 


This fimction tries to clean 


(BF.Parse) 


Function 


hrase 


up a word in a meeting text. 








It first of all determines if the 








string is of a valid length. It 








then passes it through a 








series of tests to see it is 








clean and when needed, it 








will edit the word and strip 








unnecessary characters off of 








it. Such tests includes 








getting rid of file extensions. 








non chars, numbers etc. 


EvaluateWor 


Private 


ParseAndCleanP 


This fianction tests to see if 
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d 


Function 


hrase 


this word is in the stop list so 


(BF.Parse) 






it can determine whether to 








eliminate the word from the 








original meeting text. If a 








word is not in the stoplist, it 








should stay around as a 








keyword and this ftmction 








exits beautifully with no 








errors. However, if the 








words is a stopword, an error 








must be returned. We must 








properly delimit the input 








test string so we don't 








accidentally retrieve sub 








strings. 


GoPattemMa 


Pubhc 


GoBF 


This procedure is called 


tch 


Sub 




when our QueryMethod is 


(BF.Pattem 






set to complex query 


Match) 






meaning we do want to do all 








the pattern matching stuff.It 








's a simple wrapper function 








which initializes some arrays 








and then invokes pattern 








matching on the title and the 








body. 


MatchPattem 


Public 


GoPattem Match 


This procedure loops through 


s 


Sub 




every pattern in the pattern 


(BF.Pattem 






table and tries to identify 


Match) 






different fields within a 








meeting text specified by 
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sInEvalString. For 
debugging purposes it also 
tries to tabulate how many 
times a certain pattern was 
triggered and stores it in 
gTabulateMatches to see 
whichp pattern fired the 
most. gTabulateMatches is 
stored as a global because we 
want to be able to run a batch 
file of 40 or 50 test strings 
and still be able to know how 
often a pattern was triggered. 



MatchAPatte 
m 

(BF.Pattem 
Match) 



Private 
Function 



MatchPattems 



This function goes through 
each element in the current 
pattern. It first evaluates to 
determine whether element is 
a placeholder or an indicator. 
If it is a placeholder, then it 
will try to bind the 
placeholder with some value. 
If it is an indicator, then we 
try to locate it. There is a 
trick however. Depending on 
whether we are at current 
element is the head of the 
pattern or 

not we want to take different 
actions. If we are at the head, 
we want to look for the 



63 









indicator or the placeholder. 








If we can't find it, then we 








know that the current pattern 








doesn't exist and we quit. 








However, if it is not the 








head, then we continue 








looking, because there may 








still be a head somewhere. 








We retry in this case. 


MatchMeetin 


Private 


MatchAPattem 


This function uses a big 


gField 


Function 




switch statement to first 


(BF.Pattem 






determine what kind of 


Match) 






placeholder we are talking 








about and depending on what 








type of placeholder, we have 








specific requirements 








and different binding criteria 








as specified in the 








subsequent fimctions called 








such as BindNames, 








BindTime etc. If binding is 








successfiil we add it to our 








guessing record. 


BindNames 


Private 


MatchMeetingFi 


In this function, we try to 


(BF.Pattem 


Function 


eld 


match names to the 


Match) 






corresponding placeholder 








$PEOPLE$. Names are 








defined as any consecutive 








two words which are 








capitalized. We also what to 
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retrieve a series of names 
which are comiected by and , 
or & so we look until we 
don't see any of these 3 
separators anymore. Note 
that we don 't want to bind 
single word names because it 
is probably 

too general anyway so we 
don't want to produce broad 
but irrelevant results. This 
function calls 

BindAFullName which binds 
one name so in a since 
BindNames collects all the 
results from BindAFullName 
BindAFullNa Private BindNames This function tries to bind a 

me Function full name. If the $PEOPLE$ 

(BF .Pattern placeholder is not the head of 

Match) the pattern, we know that it 

has to come right at the 
beginning of the test string 
because we've been deleting 
stuff off the head of the 
string all along. 
If it is the head, we search 
until we find something that 
looks like a full name. If we 
can't find it, then there's no 
such pattern in the text 
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entirely and we quit entirely 








from this pattern. This 








should eventually return us 








to the next pattern in 








MatchPattems. 


GetNextWor 


Private 


BindAFuU 


This function grabs the next 


dAfterWhite 


Function 


Name, 


word in a test string. It looks 


Space 




BindTime, 


for the next word after white 


(BF.Pattem 




BindCompanyTo 


spaces, @ or /. The word is 


Match) 




picLoc 


defined to end when we 
encounter another one of 
these white spaces or 
separators. 


BindTime 


Private 


MatchMeetingFi 


Get the immediate next word 


(BF.Pattem 


Function 


eld 


and see if it looks like a time 


Match) 






pattern. If so we've found a 
time and so we want to add it 
to the record. We probably 
should add more time 
patterns. But people don't 
seem to like to enter the time 
in their titles these days 
especially since we now have 
tools like OutLook. 


BindCompan 


Private 


MatchMeetingF i 


This function finds a 


yTopicLoc 


Function 


eld 


continuous capitalized string 


(BF.Pattem 






and binds it to stMatch 


Match) 






which is passed by reference 
from MatchMeetingField. A 
continuous capitalized string 
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is a sequence of capitalized 








words which are not 








intermpted 








by things hke , . etc. There's 








probably more stuff we can 








add to the list of 








interraptions. 


LocatePatter 


Private 


MatchAPattem 


This function tries to locate 


nHead 


Function 




an element which is an 


(BF.Pattem 






indicator. Note that this 


Match) 






indicator SHOULD BE AT 








THE HEAD of the pattem 








otherwise it would have gone 








to the function 








Locatehidicator instead. 








Therefore, we keep on 








grabbing the next word until 








either there's no word for us 








to grab (quit) or if we find 








one of the indicators we are 








looking for. 


ContainlnArr 


Private 


LocatePattem 


' This function is really 


ay 


Function 


Head, 


simple. It loops through all 


(BF.Pattem 




Locatelndicator 


the elements in the array 


Match) 






' to find a matching string. 


Locatehidica 


Private 


MatchAPattem 


This function tries to locate 


tor 


Function 




an element which is an 


(BF.Pattem 






indicator. Note that this 


Match) 






indicator is NOT at the head 








of the pattem otherwise it 
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would have gone to 








LocatePattemHead instead. 








Because of this, if our 








pattern is to be satisfied, the 








next word we grab HAS to 








be the indicator or else we 








would have failed. Thus we 








only grab one word, test to 








see if it is a valid indicator 








and then return result. 


InitializeGue 


Private 


MatchAPattem 


This function reinitializes 


ssesRecord 


Sub 




our temporary test structure 


(BF.Pattem 






because we have already 


Match) 






transferred the info to the 








permanent structure, we can 








reinitialize it so they each 








have one element 


AddToMeeti 


Private 


MatchAPattem 


This function is only called 


ngRecord 


Sub 




when we know that the 


(BF.Pattem 






information stored in 


Match) 






thiCurrGuesses is valid 








meaning that it represents 








legitimate guesses of 








meeting fields ready to be 








stored in the permanent 








record,tInMeetingRecord. 








We check to make sure that 








we do not store duplicates 








and we also what to clean up 








what we want to store so that 
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there's no cluttered crap such 








as punctuations, etc. The 








reason why we don't clean up 








until now is to save time. We 








don't waste resources calling 








ParseAndCleanPhrase until 








we know for sure that we are 








going to add it permanently. 


NoDuplicate 


Private 


AddToMeetingR 


This function loops through 


Entry 


Function 


ecord 


each element in the array to 


(BF.Pattem 






make sure that the test string 


Match) 






aString is not the same as 








any of the strings already 








stored 








in the array. Slightly 








different from 








ContainlnArray. 


SearchAltaVi 


Public 


GoBackGroundF 


This function prepares a 


sta 


Function 


inder 


query to be submitted to 


(BF.Search) 






AltaVista Search engine. It 








submits it and then parses the 








returning result in the 








appropriate format 








containing the title, URL and 








body/summary of each story 








retrieved. The number of 








stories retrieved is specified 








by the constant 








NUM_AV_STORIES. 








Lnportant variables include 
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stURLAlta Vista used to store 
query to submit 
stResultHTML used to store 
html jfrom page specified by 
stURLAltaVista. 



ConstructAlt 

aVistaURL 

(BF.Search) 



Private 
Function 



SearchAltaVista 



This function constructs the 
URL string for the alta vista 
search engine using the 
advanced query search mode. 
It includes the keywords to 
be used, the language and 
how we want to rank the 
search. Depending on 
whether we want to use the 
results of our pattern 
matching unit, we construct 
our query differently. 



ConstructSi 

mpleKeyWor 

d 

(BF.Search) 



Private 
Fimction 



ConstructAltaVi 
staURl, 

ConstructNewsP 
ageURL 



This function marches down 
the list of keywords stored in 
the stTitleKW or stBodyKW 
fields of the input meeting 
record and links them up into 
one string with each keyword 
separated by a connector as 
determined by the input 
variable sthiConnector. 
Returns this newly 
constructed string. 



ConstructCo 
mplexAVKe 



Private 
Function 



ConstructAltaVi 
staURL 



This function constructs the 
keywords to be send to the 
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yWord 






AltaVista site. Unlike 


(BF.Search) 






ConstructSimpleKeyWord 








which simply takes all the 








keywords from the title to 








form the query, this function 








will look at the results of BF 








's pattern matching process 








and see if we are able to 








identify any specific 








company names or topics for 








constructing 








the queries- Query will 








include company and topic 








identified and default to 








simple query if we cannot 








identify either company or 








topic. 


JoinWithCon 


Private 


ConstructCompl 


This function simply 


nectors 


Function 


exAVKey 


replaces the spaces between 


(BF.Search) 




Word, 


the words within the string 






ConstructCompl 


with a connector which is 






exNPKey 


specified by the input. 






Word, 








RefmeWith 








Rank 




RefmeWithD 


Private 


ConstructAltaVi 


This function constructs the 


ate (NOT 


Function 


staURL 


date portion of the alta vista 


CALLED 






query and returns this portion 


AT THE 






of the URL as a string. It 


MOMENT) 






makes sure that alta vista 



71 



(BF. Search) 



searches for articles within 
the past PAST_NDAYS. 



RefineWithR 
ank 

(BF.Search) 



Private 
Function 



ConstructAltaVi 
staURL 



This function constructs the 
string needed to passed to 
AltaVista in order to rank an 
advanced query search. If 
we are constructing the 
simple query we will take in 
all the keywords from the 
title. For the complex query, 
we will take in words from 
company and topic, much the 
same way we formed the 
query in 

ConstructComplexAVKeyW 
ord. 



IdentifyBloc 
k 

(BF .Parse) 



Public 
Function 



SearchAlta Vista, 
S earchNewsPage 



This ftmction extracts the 
block within a string marked 
by the beginning and the 
ending tag given as inputs 
starting at a certain 
location(iStart). The block 
retrieved does not include the 
tags themselves. If the block 
cannot be identified with the 
specified delimiters, we 
return unsuccessful through 
the parameter 

iRetumSuccess passed to use 
by reference. The return type 
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is the block retrieved. 



IsOpenURL 

Error 

(BF.Error) 



Public 
Function 



SearchAltaVista, 
SearchNewsPage 



This function determines 
whether the error 
encountered is that of a 
timeout error. It restores the 
mouse to default arrow and 
then returns true if it is a 
time out or false otherwise. 



SearchNews 
Page 

(BF.Search) 



Pubhc 
Function 



GoB ackGroundF 
inder 



This function prepares a 
query to be submitted to 
NewsPage Search 
engine. It submits it and then 
parses the returning result in 
the appropriate format 
containing the title, URL and 
body/summary of each story 
retrieved. The number of 
stories retrieved is specified 
by the constant 
UM NP STORIES 



ConstructNe 
wsPageURL 
(BF.Search) 



Private 
Function 



SearchNewsPage 



This function constructs the 
URL to send to the 
NewsPage site. It uses the 
information contained in the 
input meeting record to 
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determine what keywords to 








use. Also depending whether 








we want simple or complex 








query, we call different 








functions to form strings. 


ConstructCo 


Private 


ConstructNewsP 


This function constructs the 


mplexNPKey 


Function 


ageURL 


keywords to be send to the 


Word 






NewsPage site. 


(BF.Search) 






UnlikeConstructKeyWordStr 








ing which simply takes all 








the keywords from the title 








to form the query, this 








function will look at the 








results of BF 's pattern 








matching process and see if 








we are able to identify any 








specific company names or 








topics for constructing 








the queries. 


ConstructOv 


Private 


GoBackGroimdF 


This function takes in as 


erallResult 


Function 


inder 


input an array of strings 


(BF.Main) 






(sthiStories) and a 








MeetingRecord which stores 








the information for the 








current meeting. Each 








element in the array stores 








the stories retrieved from 








each information source. 








The function simply 








constructs the appropriate 
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output to send to Munin 








including a return message 








type to let Munin know that 








it is the BF responding and 








also the original user_id and 








meeting title so Munin 








knows which meeting BF is 








talking about. 


CoimectAnd 


Public 


GoBackGroundF 


This function allows 


TransferTo 


Sub 


inder 


Background Finder to 


Munin 






connect to Munin and 


(BF.Main) 






eventually transport 








information to Munin. We 








will be using the UDP 








protocol instead of the TCP 








protocol so we have to set up 








the remote host and port 








correctly. We use a global 








string to store gResult 








Overall because although it 








is unnecessary with UDP, it 








is needed with TCP and if we 








ever switch back don't want 








to change code. 


DisconnectFr 


Public 




omMuninAn 
d 


Sub 




Quit 






1 (BF.Main) 
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Figure 24 shows a flowchart of the actual code utilized to prepare and submit 
searches to the Aha Vista and NewsPage search engines in accordance with an 
embodiment of the present invention. Processing commences at function block 

5 2401 where a command line is utihzed to update a calendar entry with specific 

calendar information. The message is next posted in accordance with function block 
2402 and a meeting record is created to store the current meeting information in 
accordance with function block 2403. Then, in function block 2404 the query is 
submitted to the Alta Vista search engine and in function block 2405, the query is 

10 submitted to the NewsPage search engine. When a message is returned from the 

search engine, it is stored in a results data structure as shown in function block 2406 
and the information is processed and stored in summary form in a file for use in 
preparation for the meeting as detailed in function block 2407. 

1 5 Figure 25 provides more detail on creating the query in accordance with an 

embodiment of the present invention. Processing commences at function block 
2505 where the meeting record is parsed to obtain potential companies, people, 
topics, location and a time. Then, in function block 2506, at least one topic is 
identified and in function block 2507, at least one company name is identified and 

20 finally in function block 2508, a decision is made on what material to transmit to the 
file for ultimate consiraiption by the user. 

Figure 26 is a variation on the query theme presented in Figure 25. A meeting 
record is parsed in function block 2600, a company is identified in function block 
25 2620, a topic is identified in function block 2630 and finally in function block 2640 
the topic and or the company is utilized in formulating the query. 

While various embodiments have been described above, it should be understood that 
they have been presented by way of example only, and not limitation. Thus, the 
30 breadth and scope of a preferred embodiment should not be limited by any of the 
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above described exemplary embodiments, but should be defined only in accordance 
with the following claims and their equivalents. 
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CLAIMS 

What is claimed is: 

1 1 . A method for a knowledgebase framework, comprising the steps of: 

2 (a) obtaining information from at least one source utilizing a network; 

3 (b) generating an index for the obtained information, wherein the generated 

4 index comprises a plurality of items each associated with at least some of the 

5 obtained information; 

6 (c) displaying the index to a user utilizing the network; 

7 (d) permitting the user to select an least one of the items of the index; and 

8 (e) displaying the information associated with the selected item to the user 

9 utilizing the network. 

1 2. A method as recited in claim 1 , wherein the at least one source comprises an 

2 internal source. 



1 3. A method as recited in claim 1 , wherein the at least one source comprises an 

2 external source accessible utilizing a wide are network. 

1 4. A method as recited in claim 1, wherein the information obtained from the 

2 sources includes pharmaceutical information. 

1 5. A method as recited in claim 1 , ftirther comprising the steps of monitoring at 

2 least one of the sources utilizing the network for updated information relating 

3 to at least one of the items of the index, generating a notice regarding the 

4 updated information, and fransmitting the notice to the user utilizing the 

5 network. 



1 
2 



6. A method as recited in claim 5, wherein the user selects the at least one 
source to be monitored. 
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1 7. A method as recited in claim 1, further comprising the steps of permitting the 

2 user to input a search term utihzing the network, searching the index for 

3 items associated with the search term, and displaying items of the index 

4 associated with the search term to the user utilizing the network. 

1 8. A method as recited in claim 1 , wherein displaying the information 

2 associated with the selected item or entry to the user includes utilizing the 

3 network to retrieve the associated information from the source from which 

4 the associated information was obtained. 

1 9. A method as recited in claim 1 , wherein the network is capable of 

2 communicating using TCP/IP protocol. 

1 10. A computer program embodied on a computer readable medium for a 

2 knowledgebase framework, comprising: 

3 (a) a code segment that obtains information from at least one source utilizing a 

4 network; 

5 (b) a code segment that generates an index for the obtained information, wherein 

6 the generated index comprises a plurality of items each associated with at 

7 least some of the obtained information; 

8 (c) a code segment that displays the index to a user utilizing the network; 

9 (d) a code segment that permits the user to select an least one of the items of the 

10 index; and 

11 (e) a code segment that displays the information associated with the selected 

12 item to the user utilizing the network. 

1 11. A computer program as recited in claim 1 0, wherein the at least one source 

2 comprises an internal soiirce. 



-79- 



1 12. A computer program as recited in claim 1 0, wherein the at least one source 

2 comprises an external source accessible utilizing a wide are network. 

1 13. A computer program as recited in claim 1 0, wherein the information 

2 obtained from the sources includes pharmaceutical information. 

1 14. A computer program as recited in claim 1 0, ftirther comprising a code 

2 segment that monitors at least one of the sources utilizing the network for 

3 updated information relating to at least one of the items of the index, a code 

4 segment that generates a notice regarding the updated information, and a 

5 code segment that transmits the notice to the user utilizing the network. 

1 15. A computer program as recited in claim 14, wherein the user selects the at 

2 least one source to be monitored. 

1 16. A computer program as recited in claim 10, further comprising a code 

2 segment that permits the user to input a search term utihzing the network, a 

3 code segment that searches the index for items associated with the search 

4 term, and a code segment that displays items of the index associated with the 

5 search term to the user utilizing the network. 

1 17. A computer program as recited in claim 1 0, wherein displaying the 

2 information associated with the selected item or entry to the user includes 

3 utilizing the network to retrieve the associated information from the source 

4 from which the associated information was obtained. 

1 18. A computer program as recited in claim 1 0, wherein the network is capable 

2 of communicating using TCP/IP protocol. 

1 19. A system for a knowledgebase framework, comprising: 



-80- 



2 (a) logic that obtains information from at least one source utilizing a network; 

3 (b) logic that generates an index for the obtained information, wherein the 

4 generated index comprises a plurality of items each associated with at least 

5 some of the obtained information; 

6 (c) logic that displays the index to a user utilizing the network; 

7 (d) logic that permits the user to select an least one of the items of the index; and 

8 (e) logic that displays the information associated with the selected item to the 

9 user utilizing the network. 

1 20. A system as recited in claim 19, further comprising logic that monitors at 

2 least one of the sources utiUzing the network for updated information relating 

3 to at least one of the items of the index, logic that generates a notice 

4 regarding the updated information, and logic that transmits the notice to the 

5 user utilizing the network. 
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SYSTEM, METHOD, AND ARTICLE OF MANUFACTURE FOR A 
KNOWLEDGEBASE FRAMEWORK 

ABSTRACT 

A system, method and article of manufacture are provided for a knowledgebase 
framework. Information is obtained from at least one source utilizing a network. 
Utilizing a knowledge model, an index is generated for the obtained information. 
The generated index includes a plurality of items each associated with at least some 
of the obtained information. UtiUzing the network, the generated index is displayed 
to a user. The user is permitted to select an least one of the items of the index. The 
information associated with the selected item is then displayed to the user utilizing 
the network. 
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OBTAINING INFORMATION FROM AT LEAST ONE SOURCE 
UTILIZING A NETWORK 



GENERATING AN INDEX FOR THE OBTAINED INFORMATION, 
WHEREIN THE GENERATED INDEX COMPRISES A PLURALITY OF 
ITEMS EACH ASSOCIATED WITH AT LEAST SOME OF THE 
OBTAINED INFORMATION 
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DISPLAYING THE INDEX TO A USER UTILIZING THE NETWORK 



PERMITTING THE USER TO SELECT AN LEAST ONE OF THE ITEMS 
OF THE INDEX 



DISPLAYING THE INFORMATION ASSOCIATED WITH THE 
SELECTED ITEM TO THE USER UTILIZING THE NETWORK 
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FIG. 5 
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ACCESSING AT LEAST ONE SOURCE UTILIZING A NETWORK 
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EXTRACTING INFORMATION FROM THE SOURCE UTILIZING THE 
NETWORK 



CONVERTING THE FORMAT OF THE EXTRACTED INFORMATION 
TO A COMMON FORMAT 



GENERATING AN INDEX FOR THE INFORMATION UTILIZING A 
KNOWLEDGE MODEL 



STORING THE GENERATED INDEX IN A DATABASE 



FIG. 6 



o 




1700 



17.02 



RECEIVING INFORMATION RELATING TO A TARGET TO BE 
TRACKED FROM A USER UTILIZING A NETWORK 



MONITORING THE NETWORK FOR CHANGES TO THE TARGET 



RETRIEVING DATA RELATING TO THE MONITORED CHANGES 
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TRANSMITTING THE RETRIEVED DATA TO THE USER UTILIZING 
THE NETWORK 



FIG. 17 
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Public Function 
ParseMeetin^ext 

(Extracts ke^ords from meeting record) 

2050 



Public Function 
GoFatternMatch 
(Initiates patten matching) 

2060 



Public Function 
SearchAIfaVista 

(PaneResalts) 
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Public Function 
SearchNewsPage 
(Query and Parse Results) 
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Private Function 
ConstructOverallResult 

(Prepares data) 
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Public Sub 
ConnectAndTransferToMunin 
(Sends data to Mnnin) 
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Built-in Function 
WinsodcSendData 
(Sends data through UDP) 
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Public Sub 
DisconnectFromMuninAndQuit 

(Once data is sent, clean program and exit) 

2097 
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GoBac^oiffldFindff 
(Wrapper FDicdoii} 

2110 



Public Fnndion 
ParseMeefingTeit 
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Private Function 
FanmlSeliniitation 
(Make suit M deUters ait plaul comctly] 
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Public Function 
MenmneNnniWords 
(Rttiini nimlKr of mrds in sbinf separator) 
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Public Function 
GetAWordFromString 
(Gttitb field iraiiiipgtiiuefin'ttit) 
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Private Function 
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(Eilracis a word at a tiie from title aid My) 



Private Function 
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Private Function 
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EvaluateWord 
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InltializeGuessesRecord 
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Main 
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GoBackgroundFinder 
(Wrapper Function) 
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Public Function 
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Public Function 
IsOpenURLError 
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Private Function 
ConstractSimpleKeyWord 
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Private Function 
ConstructCon^lexAVKeyWord 
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Private Function 
JoinWithConnectors 
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Private Function 
JoinWithConnectors 
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Public Function 
SearchNewsPage 
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COMMAND LINE 
7 userjd, meeting title, meeting body, list, location, time" 








2402 — 


MESSAGE 

"userjd, meeting title, meeting body, participant list, time" 
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MEETING RECORD TO STORE CURRENT MEETING INFORMATION 
stUSERID, sTItleOrig, sTitleKW, sBodyKW, sLocation, sTime, 
sParticipants( ), sMeetingText: original message minus userjd 
sCompany, sPeople, sTopic, sWhen, sWhere from GoPatternMatcn 
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SUBMIT QUERY TO ALTA VISTA 
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SUBMIT QUERY TO NEWSPAGE 
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STORE MESSAGE IN gResultOverall 
"msgjd, userjd, meeting title concatenated with stories 
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PROCESS STORIES FROM ALTA VISTA AND NEWSPAGE 



FIG. 24 
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A MEETING RECORD - POTENTIAL COMPANIES, PEOPLE, TOPICS, 
LOCATION AND A TIME ARE IDENTIFIED 



2506- 



AT LEAST ONE TOPIC IS IDENTIFIED 



2507 — AT LEAST ONE COMPANY NAME IS IDENTIFIED 



2508- 



A DECISION IS MADE ON WHAT MATERIAL TO TRANSMIT 
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FIG. 25 
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A MEETING RECORD - POTENTIAL COMPANIES, PEOPLE, TOPICS, 
LOCATION AND A TIME ARE IDENTIFIED 






AT LEAST ONE COMPA 


NY NAME IS IDENTIFIED 
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AT LEAST ONE TOPIC IS IDENTIFIED 
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USE THE TOPIC AND OR THE COMPANY 



FIGURE 26 



